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Certificate of Facsimile Transmission 
I hereby certify that this correspondence is being facsimile transmitted to the Commissioner for Patents, 
Box AF, Washington, D.C. 20231 on the date/iet fyth below. 

4-|w|03 By: 



Date of Signature 




Maria Laccotripe Zacharakis, PhJ). * 
Attorney for Applicant 
Limited Recognition Under 37 C.F.R. 10.9(b) 



DECLARATION PURSUANT TO 37 CPR §1.131 



Dear Sir: 

I, Rory A. J. Curtis, a citizen of the United Kingdom, residing at 78 Hardwick Road, 
Ashland, Massachusetts 01721 hereby declare as follows: 



Serial Number: 09/587,111 -2- W Group Ait Unit: 1646 

(1) I am the inventor of the subject matter described and claimed in the above-identified 
application. 

(2) Prior to January 22, 1999, the invention described and claimed in the above- 
referenced patent application was completed in this country, as evidenced by the following: 

(a) Prior to January 22, 1999 I had completed the sequencing of the foil length 
human VR-2 (SEQ ID NO:5) as evidenced by copies of the assembled contig map and the 
complete sequence of SEQ ID NO:5 derived from die contig map. The contig map and sequence 
print-out are submitted herewith as Exhibits A and B, respectively. 

(b) Prior to January 22, 1 999 I had determined that the polypeptide of SEQ ID 
NO:5 was a vanilloid receptor based on the results of a blast search using the amino acid 
sequence of SEQ ID NO:5. The top hit in this blast search analysis is rat vanilloid receptor 1 
(VR-1; accession number AF029310). Based on the percent identity and percent similarity 
between the polypeptide of SEQ ID NO:5 and rat VR-1, and the fact that rat VR-1 was the top hit 
in the blast search analysis I determined that the polypeptide of SEQ ID NO:5 was a vanilloid 
receptor. The results of the blast search analysis ?re submitted herewith as Exhibit C. 

(c) Prior to January 22, 1999, 1 had determined that the polypeptide of SEQ ID 
NO:5 represents a unique target for pain and that it may be responsible for hypersensitivity in 
chronic neuropathic pain, as evidenced by a copy of the "Qualified Target Summary Sheet" 
generated for this molecule. The Qualified Target Summary Sheet is submitted herewith as 
Exhibit D. 



Serial Number: 09/587,111 
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Group Art Unit: 1646 



(d) According to my routine practice, immediately after identifying the polypeptide 
of SEQ ID NO:5 as a vanilloid receptor (hVR-2) associated with pain and classifying it as a 
Qualified Target, I contemplated routine uses of this molecule such as: (i) use of the molecule in 
screening assays to identify modulators of the vanilloid receptor or (ii) use of this molecule in the 
diagnosis of conditions/diseases associated with, for example, aberrant vanilloid receptor (hVR- 
2) nucleic acid expression or activity, e.g, 3 pain disorders, as evidenced by a copy of the 
"Qualified Target Summary Sheet" generated for this molecule. 

(e) The dates deleted from the con tig map (Exhibit A), the sequence print-out 
(Exhibit B), the blast search results (Exhibit C), and the Qualified Target Summary (Exhibit D) 
are prior to January 22, 1999. 

I hereby declare that all statements made herein of my own knowledge are true and that 
all statements made on information and belief are believed to be true; and further that these 
statements are made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful false statements may jeopardize the validity of this Application for 
Patent or any patent issuing thereon. 
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321 465 611 773 931 1,109 1,309 1,509 1,713 1,951 2,199 2,497 2,809* 
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□ Hole in oontig Diaaram K u 
— Single fragment piaqram * M 
Multiple fragments same direction 
Both strands 
Both ^ands plus 
' 1 1 i l l fe* Start oodon frame 1 
tfe^T^Stop codon frame 2 
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Exhibit B 
ZLella 

Segueaxher 01 *21fillraoe£inal* 



flbXc21ollz2 41 
flhXc31ellsl II 
Genbank N24» >#l> 
Genbank N28.- >#1> 
Qenbank N29- >ll> 
flhXc21elli3 >41> 

#1 



OQCTAGCCTG TCCTGACAGG GGAGAGTTAA OCTCCCGTTC TCCACCOTOC CGGCTGGCCA GGTGGGCTGA GGGTOACCGA GAQACCAQAA 
OGCTAGCCTG TCCTGACAGG GGAGAGTTAA OCTCCCGTTC TCCACCGTGC CGGCTGGCCA OGTGGGCTOA GGOTGACCGA OAGACCAGAA 
GCTAOCCTG TCCTGACAGG GGAGAGTTAA GCTCCCGT:C TCCACCGTGC CGGCTGNCiA GGTGGGCTGA GGOTGACCGA OAGACCAGAA 
GCTAGCCTG TCCTGACAGG G t AGAGTTAA OCTCCCGTTC TCCACCGTGC CGGCTGGC I A GGTGGGCTGA GGGTGACCGN GAGACCAQAA 
OACAGG GGAGAGTTAA OCTCCCGTTC TCCACCGTGC CGGCTGNC i A GGTGGGCTGA GGOTGACCGA GAGACCAQAA 

TCCACCGTGC CGOCTOOCCA GOTGGtfCTGA GGOTGACCGA NAGACCAGAA 



GGCTAGCCTO TCCTGACAGG GGAGAGTTAA GCTCCCOTTC TCCACCGTGC CGGCTGGCCA GGTGGGCTGA GGOTGACCGA GAGACCAQAA 
G . P V LTQ E S . APVL HRA GWP GGLR VTE R P E 
• • + * + + + 



flhXe21ellz2 191 
flbXc21edlzl #91 
Qenbank #90 
Genbank N280-. #90 
Qenbank N29L-. #77 
flhXc21elli2 #51 
flhKc21ellg2 >#i> 
£lhXc31ellgl >#l> 
flbJQlellrl9. >#1> 
flhX21ellrl9_ >ftl> 
flhXcfllellhl >ll> 

#91 



CCTQCTTGCT 
CCTGCTTGCT 
CCTGCTTGCT 
CCTGCTTGCT 
CCTGCTTGCT 
CNTGCTTGCT 



GQAGCTTAQT 
GGAGCTTAGT 
OGAQCTTAGT 
GGAGCTTAGT 
GGAGCTTAGT 
GAAGTTTAGT 



GCTCAGAGCT 
GCTCAGAGCT 
GCTCAGAGCT 
GCTCAGAGCT 
GCTCAGAGCT 
GTTCAGAGCT 
CAAAGCT 



GGGGAGGGAG 
G GO GAG OG AG 
GGGOANGGAG 
GGGGAGGGAG 
GGGGAGQGAG 
GGGAAGGGAG 
GGGAAOQAAG 



GTTCCGCCQC 
GTTCCGCCOC 
GTTCCGCCQC 
GTTCCGCCOC 
GTTCCGCCQC 
GTTCCGCNGC 
GTTCCGCNGC 
TTCCQCCGN 



TCCTCTGCTQ 
TCCTCTGCTG 
TCCTCTGNTG 
TCCTCTGCTG 
TCCTCTGCTG 
TCNTCTOCTG 
TCtlTCTQCTG 
TCWTCTGTTG 



TCAQCGCCOG 
TCAOCGCCGG 
TCA 

TCAiCGCCGG 
TCAlCGCCGG 
TCAQCQCCGG 
TCAOCGCCGG 
TCAGCGCiGG 



CAGCCCCTCC COOCTTCACT 
CAGCCCCTCC CQGCTTCACT 



CAGCCCCTCC 
CAGCCCCTCC 
CAGCCCCTCC 
CAGCCCCTCC 
CAGCCCNTCC 
AGCCCCTCC 
AOCCCCTCC 
AGCCCNTCC 



CQGCTTCACT 
CQGCTTCACT 
NGGCTTCACT 
NGGCTTCACT 
HGOCTTCACT 
CGGCTTCANT 
CQGCTTCACT 
NGGCTTCANT 



CCTGCTTGCT GGAGCTTAGT GCTCAGAGCT GGGGAGGGAG GTTCCGCCGC TCCTCTGCTG TCAQCQCCGG CAGCCCCTCC CQGCTTCACT 
PACW SLV L R A GBGG S A A PLL SAPA APP GFT 

♦ • * * • • + • + + + « +4 4. 



flhXC21ellz2 #181 
OhXc21oll3El #181 
Genbank K28- #180 
Genbank N29- #167 
flhxc31elll2 #141 
flbXc21ellg2 #66 
flJhXc21ellgl #50 
flhX21ellrl9srl #20 
flh«lellrl9g2 #20 
flhlc21ellhl #20 
OhX21ellrtU. >#l> 
£lhXd21ellbl >«l> 
flhxcaiellll >#1> 
flhXcilellbl >#1> 
flhXc21ellb2 >#!> 
jtbsal03«2b2 >ftl> 
jthsal03o2bl >«i> 
Jthsal03e2t2 >#1> 
jthsal03e2U >#1> 
Genbank AA4 W >#l> 
jthsal03e02t- >#1> 
flhX21ellr21_ ># i> 
flhX21ollr21- >#1> 
flhX31ellr21~ >#]> 



TCCTCCCGCA 


GCCCCTGCTA 


CTGAGAAGC 


CCC 


AQCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTOC 


TCCTCCCGCA 


GCCCCTQCTA 


CTGAGAAGC 


ccc 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAOCCTGC 


TCCTCCCGCA 


GCCCCTQCTA 


CTGAQAAQCT 


CCGGGATCCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGN 


TCCTCCCGCA 


GCCCCTGCTA 


CTGAGAAGCT 


CCGGGATCCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTOC 


TCCTCCCGCA 


GCCCNTGCTA 


CTGAGAAGC 


CCC 


AQCAGCCGCC 


ACGCCCTGGC 


CTCAOCCTGC 


TCCTCCCGCA 


GCCCCTGCTA 


TTGAGAAGC 


CCC 


AGCAGCCGCC 


ACGCCTTGG : 


CTCAGCCTGC 


TCCTCCCGCA 


GCCCNTGCTA 


TTQAQAAGC 


CCC 


AQCAGCCGCC 


ACGCCNTGGC 


CTCAQ : CTG : 


TCCTCCCGCA 


GCCCCTGCTA 


CTGAGAAGCT 


CCGGGATCCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGC 


TCCTCCCGCA 


GCCCCTGCTA 


CTGAGAAGCT 


CCGGGATCCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAOCCTGC 


TCCTCCNGCA 


GCCCTTGNTA 


CTQAGAAQN 


iCC 


AACAGCCGCC 


ACOCCNCGG : 


CTCAOiCTGi 


CGCGCA 


GCCCCTGCTA 


CTGAGAAGCT 


CCGGGATCCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAOCCTGC 






TGAHAAGC 


CCC 


AGC AGCCNNC 


ACTCCNT GQC 


CNNAACCTGC 






TGAGAAGC 


CCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGC 






TGAGAAGC 


CCC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGC 






TGAGAAGC 


CCC 


AQCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGC 






C 


ACGCGTCCOC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGT 






C 


ACGCGTCCGC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGT 






C 


ACGCGTCCGC 


AGCAGCCGCC 


ACGCCCTGGC 


CTCAGCCTGT 








ACGCGTCCGC 


AGCAGCCGCC 
GCAGCCQC 1 
GCAGCCGCC 


ACGCCCTGGC 
ACQCCCTGGC 
ACGCCCTGGC 
TQGC 


CTCAGCCTGT 
CTCAGCCTGC 
CTCAGCCTGT 
CTCAGCHTGC 



GGGOCTCCAG 
GGOGCTCCAG 
GNGGCTCCAG 
GGGOCTCCAG 
GGGOCTCCAG 
GGGOCTCCAG 
GGOGCTCCAG 
GGGOCTCCAG 
GGOGCTCCAG 
GGGQNTCCAG 
GGGOCTCCAG 
GOGQCTCCAN 
GGGOCTCCAG 
GGOGCTCCAG 
GGOGCTCCAG 
GGGOCTCCAG 
GGGOCTCCAG 
GGOGCTCCAG 
GGOGCTCCAG 
GTGGCTCCAG 
GGGOCTCCAG 
GGGOCTCCAG 
TCCAG 
AG 



TCAQGCCAAC 
TCAQGCCAAC 
TCAQGCCAAC 
TCAQGCCAAC 
TCAQGCCAAC 
TCAQGCCAAC 
TCAQGCCAAC 
TCAQGCCAAC 
TCAGOCCAAC 
TCAGOCCAAC 
TCAQGCCAAC 
NNNNNCCAAC 
TCAGOCCAAC 
TCAQGCCAAC 
TCAGOCCAAC 
TCAQGCCAAC 
TCAGOCCAAC 
TCAGOCCAAC 
TCAQGCCAAC 
TCAGOCCAAC 
TCAQGCCAAC 
TCAQGCCAAC 
TCAGOCCAAC 
TCAQGCCAAC 



#181 



TCCTCCCGCA GCCCCTGCTA CTGAGAAGCT CCGGGATCCC AOCAOCCQCC ACGCCCTGGC CTCAGCCTGC GGGOCTCCAG TCAGOCCAAC 
S S R 8 PCY . E A PQSQ QPP RPG LSLR OSS QAN 
«+ #+•++•««• • +♦« • ♦ -f + • ♦ + 4 + + ♦+ + 



21611a 

Sequfinchet™ "21ellraaefiiial < ' 



ft|flbXc21ellz2 1371 
9flhXc21ellZl §271 
9 Oenbonk N2B« #270 
§) Oenbank N29- 1257 
3flhJtc21elU2 «231 
8jflhXc31ellg2 1158 
?3eihXc21ellgl #140 
§)flhX21ellrl9-. #110 
9flWQlellrl9-, 1110 
28flKXc21ellhl #110 
28flfaX2lellr21v2 #87 
9flhxd21ellbl #70 
2j*lhXc21ellil §70 
3 flhXc21ellbl #70 
9flhXc21allb2 #70 
3 jthaa!03e2b2 #62 
9 jthsal03«2hl #62 
9 jthsal03e2t2 #62 
Si jthsal03e2tl #61 
§1 Genbonk AM 5- #50 
51 jthswl03e02tl~ #50 
9 flhX21ellr21a2 #35 
9 flhX21ellr21al #16 
9flhX21ellr21g2 #13 
S| Qenbank B20» >#1> 
3 GanbanJc HS0. >#l> 
3 Genbank B49- >#l> 
9 £UiX21ellr20„ >ll> 
^ flhX21ellr20^. >#l> 
3 flhJC21ellr22- >#1> 
3flhX21ellr22_ >#i> 
3 flhX21elir20-. >«1> 

#271 



ACCQACQCQC 
ACCQACQCGC 
ACCGACOCGC 
ACCQACGCGC 
ACCQACGCGC 
ACCOACOCOC 
ACCQACQCGC 
ACCQACQCQC 
ACCQACQCGC 
ACONACGiGC 
ACCQACQCGC 
ACCQACQCGC 
ACCQACQCGC 
ACCQACQCGC 
ACCQACQCQC 
ACCQACQCQC 
ACCQACGCGC 
ACCQACQCGC 
ACCQACGCGC 
ACCQACQCQC 
ACCQACGCGC 
ACCQACGNGC 
ACCQACGCGC 
ACCQACGNGC 



AGCTGGGAGQ 
AQCTQQGAQG 
AINTQQQAGG 
AiNTQGGAGQ 
AGCTQQQAQQ 
AGCTQQQAGQ 
AQCTQGGAGQ 
AGCTGGGAGQ 
AGCTQQQAQQ 
A j NTTGQAGN 
AGCTGGQAGG 
AGTTGGGNGG 
AGNTQGGAQG 
AGCTQQQAQQ 
AGCTQQQAQQ 
AGCTGGGAGQ 
AGCTGGGAGQ 
AGCTGGQAGG 
AGCTQQQAQG 
A l CTQQG AQQ 
AGCTGGQAGG 
AGCT: QQAGG 
AQCT : GQAQQ 
AGNTQGGAQG 



AAQACAQGAC 
AAAACANGAC 
AAQACAQGAC 
AAQACAQGAC 
AAQACAQGAC 
AAQACAQGAC 
AAQACAQGAC 
AAQACAQGAC 
AAGACAQQAC 
AAQACAQGAC 
AAQACAQGAC 
ANGACAGGAC 
AANACAGGAC 
AAGACAQQAC 
AAGACAQQAC 
AAQACAQGAC 
AAGACAQQAC 
AAGACAQQAC 
AAQACAQGAC 
AAQACAQGAC 
AAGACAQQAC 
AAGACAQQAC 
AAGACAQQAC 
AAQACAQGAC 



CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACATC 
CCTTGACMTC 



TKCATNTQCA 
TKCATCTOCA 
TCCATCTQCA 
TCCATCTGCA 
TCCATCTQCA 
TCCATCTGCA 
TCCATCTQCA 
TCCATCTQCA 
TCCATCTQCA 
TCCATCTGCA 
TCCATCTQCA 
TCCATCTQCA 
TCCATCTGCN 
TCCATCTQCA 
TCCATCTQCA 
TCCATCTGCA 
TCCATCTGCA 
TCCATCTQCA 
TCCATCTQCA 
TCCATCTQCA 
TCCATCTQCA 
TCCATCTQCA 
TCCATCTGCA 
TCCATCTQCA 



CAAANGTCCT 
CANANGTCCT 
CA AQGTCCT 
CAOAGGTCCT 
CAQ AQGTCCT 
CAG AQGTCCT 
CAQ AQGTCCT 
CAG AQGTCCT 
CAG AQGTCCT 
CAHAGGTCCT 
CAQ AQGTCCT 
CAQAGQTCCT 
CAN AQGTCCT 
CAQAGQTCCT 
CAOAGGTCCT 
CAQAGQTCCT 
CAQAGQTCCT 
CAQAGQTCCT 
CAOAGGTCCT 
CAGAQGTCCT 
CAQAGQTCCT 
CAQAGQTCCT 
CAOAGGTCCT 
CAOAGGTCCT 
GAGAGGTCCT 
AO AQGTCCT 
AG AQGTCCT 



GQCTQQACCQ 
OGCTOGACCG 
OQCTOOACOA 
GQCTQQACCQ 
QQCTQOACCG 
GQCTGGACCG 
GGCTGGACCG 
GQCTQQACCQ 
GQCTQQACCQ 
GQCTQQACCQ 
GOCTOOACCG 
GQCTGGACCG 
GGCTGGACCG 
OGCTOGACCG 
QGCTOOACCQ 
GGCTGGACCG 
GGCTGGACCG 
GGCTGQACCG 
GGCTGGACCG 
QGCTQO : CCG 
GGCTGQACCG 
GQCTGGACCG 
OGCTOGACCG 
GONTQOACCG 
OGCTGOACCA 
GQCTQGACNC 
GGCTGQACAT 
Q 
Q 
Q 
O 
Q 



AGCAGCCTNC 
AGCAGCCTNC 
NQCAQCCTCC 
AGCAQCCTCC 
AOCAGCCTCC 
AOCAQCCTCC 
AOCAGCCTCC 
AOCAGCCTCC 
AOCAGCCTCC 
AOCAGCCTCC 
AOCAGCCTCC 
AGCAQNCTNC 
AOCAQCCTCC 
AOCAGCCTCC 
AOCAGCCTCC 
AOCAQCCTCC 
AOCAGCCTCC 
AOCAGCCTCC 
AGCAQCCTCC 
AOCAGCCTCC 
AGCAQCCTCC 
AGCAQCCTCC 
AGCAQCCTCC 
AOCAGCCTCC 
NOCAQCCTCC 
:GCAQCCTCC 
1 GCAQCCTCC 
AGCAOCCTCC 
AQCAOCCTCC 
AOCAGCCTCC 
AGCAQCCTCC 
AOCAGCCTCC 



TNCT 1 CTAQG 
TNCT : CTAQQ 
TCCTCC Q 
TCCTCC Q 
TCCTCCTAQQ 
TCCTCCTAQQ 
TCCTCCTAQQ 
TCCTCCTAQQ 
TCCTCCTAQQ 
TCCTCCTAQQ 
TCC TNCT AQQ 
TNCTC CTAQQ 
TCCTCCTAQQ 
TCCTCCTAGG 
TCCTCCTAQQ 
TCCTCC G 
TCCTCC G 
TCCTCC Q 
TCCTCC Q 
TCCTCC Q 
TCCTCC G 
TCCTCCTAGG 
TCCTCCTAQQ 
TCCTCCTAGG 
TCCTCC Q 
TCCTCC G 
TCCTCC Q 
TCCTCCTAGG 
TCCTCCTAGG 
TCCTCCTAQQ 
TCCTCCTAGG 
TCCTCCTAQQ 



ACCGACGCGC AGCTQGGAGG AAQACAQGAC CCTTGACATC TCCATCTQCA CAOAGGTCCT GGCTGQACCG AOCAQCCTCC TCCTCCTAGG 
TDAQ LOG RQD P . H L HLH R G P GHTE Q P P P P R 
+ * •••+♦+•+ + + + + • « +++ , 



21ella 

Sequencher*" w 21ellraoefinal" 



RflhXcaiellrt 1361 
RflhXc21«ll»l 1361 
£1 Qenbank N28~ 1360 

5 Oonbank M39 w #347 
R|flhXc31elli2 #321 
RflMEc21ellg2 #246 
QrihXc21ellol #230 
§|flhX2lellrl9„ #200 
HflW(21ellrl9... #200 
£1 £lhXc21elltal #200 
KfliiX21ellr2L- #177 
@flhXcJ21ellbl #160 
HflhXc21ollil #160 
?|flhXc21ellbl #160 
3flhXc21ellb2 #160 
Hjthsal03e2b2 #152 
£3jthsal03e2bl #152 
Qjthsal03e2t2 #152 
9 jthsal03e2tl #151 
£3 Genbank AA4- #140 
9 jtbsol03e02U #140 
9flhX21ellr2U. #125 
8flhX21ellr21_ #106 
3flhX21ellr21_ #103 
9 Qenbank K201~ #41 

6 Oeobank H504~ 140 
?9 Genbonk H491™ #40 
9 flhX21ellr20gl #22 
9 flhX21ellr20g2 #22 
^£lhX21ollr22A2 #22 
Bflljx21ellr22al #22 
3flhX21eUr20a2 #22 
9flhX21dllrrl» >#l> 
?8 fiwaieiine-. >#i> 

9*lfaX21ellrt8-. >#1> 
9 £lhX21ellrl6-. Ml> 
9flhX21ellrl8~ >#l> 
9 flhJQlelinS- >#1> 
9 flhXUellrlS-. >#1> 
9 flhX21ellr20-. >#l> 

Qmudieiiria. >«i> 

BjflfaX21ellrl7- >«l> 
$flhX21ellrl7_ >#1> 
9 flfaX31«Url7 M >#1> 
9flhX21«llrl5- >#1> 
3flhX21ollrl5- >#1> 
9nhx21ellcl8-. >#!> 
9flhX21allrl6- >«1> 
9flhX21ellrrU >«l> 
3flhX2lellrl7_ >#i> 
5| flhX21ellrt7_ >#l> 
9flfaX21ellrl6- >#1> 
9flhK21©llrl6- >#1> 
9 Oenbenk AA7-. >ffl> 
91 flhX21allr22.- >#1> 
9 Oenbank AA7- >#1> 
9nfaX21«llrl9~ >#1> 

9afc&2ieUri9 w >#i> 
9 Qenbank AXL. >#!> 



ATGACCTCAC CCTNCAGCTC TNCAOTTTTT AG TTGOAGA CANTA 

ATQACCTCAC CCTNCAACTC TTCAGTTTTT AAGNTGOAGA CATTATATOO AGGHCCAAAA NATOGCTCTN AGOCGGACAA AHGAAA I CTT 
ATGACCTCAA CCTCCAGCTC TCCAGTTTTC AGGTTOGAGA CATTAGATGG AGGCCAAGAA GATGGCTCTG AGGCGGACAG AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTGGAGA CATTAGATGG AGGCCAAOAA GATGGCTCTG AGGCGGACAG AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTGGAGA CATTAGATGG AGGCCAAOAA GATGGCTCTG AGGCGGACAG AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTGGAGA CATTAGATGG AGGCCAAOAA GATGGCTCTG AGGCGGACAO AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTGGAGA CATTAGATGG AGGCCAAOAA GATGGCTCTG AOOCGOACAG AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTGGAGA CATTAGATGG AGGCCAAOAA GATGGCTCTG AGGCGGACAO AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTGGAGA CATTAGATGG AGGCCAAOAA GATGGCTCTG AGGCGGACAG AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTGGAGA CATTAGATGG AGGCCAAOAA GATOOCTCTG AOOCGOACAG AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTGGAGA CATTAGATGG AOGCCAAOAA GATGGCTCTG AOGCGOACjO AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAOTTTTN AOGTTGGANA CATTAGATGG AGGCCAAOAA GATOOCTCTG AOOCGOACAG AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTGGAGA CATTAOATOG AOOCCAAOAA GATGGCTCTG AOGCOGACAO AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTOOAGA CATTAOATOG AOOCCAAOAA GATOOCTCTG AOOCGOACAG AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTGG AG A CATTAGATGG AGGCCAAOAA GATGGCTCTG AOGCOOACAO AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTOGAGA CATTAGATGG AGGCCAAGAA GATGGCTCTG AGGCGGACAO AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGO TTGOAGA CATTAGATGG AGGCCAAOAA GATGGCTCTG AGGCGGACAG AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTGGAGA CATTAGATGG AGGCCAAGAA GATGGCTCTG AGGCGGACAG AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAOTTTTC AGGTTGGAGA CATTAGATGG AGGCCAAGAA GATGGCTCTG AGGCGGACAO AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAOTTTTC AGGTTGGAGA CATTAGATGG A : G : CAAGAA GATGGCTCTG AGGCGGACAG AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAOTTTTC AGGTTGOAOA CATTAGATNG AGGCCAAGAA AATGGCTCTG AGGCGGACAG AOGAAAGCTG 
ATGACCTCAC CCTCCAGCTC TCCAOTTTTC AGGTTGGAGA CATTAOATOG AGGCCAAGAA GATOOCTCTG AOGCOGACAO AGGAAAOCTG 
ATGACCTCAC CCTCCAGCTC TCCAOTTTTC AGGTTOOAGA CATTAOATOG AGGCCAAGAA GATGGCTCTG AOGCOGACAO AGOAAAGCTO 
ATGACCTCAC CCTCCAGCTC TCCAOTTTTC AGGTTOGAGA CATTAGATGG AOOCCAAOAA GATGGCTCTG AGGCGGACAG AOGAAAGCTG 
ATGACCTCAC CCTCCAOCTC TCCAOTTTTC AGGTTGG AG A CATTAGATGG AGGCCAAGAA GATGGCTCTG AGGCGGACAG AGGAAAOCTG 
ATGACCTCAC CCTCCAGCTC TCCAOTTTTC AGGTTOGAGA CATTAGATGG AGGCCAAGAA GATGGCTCTG AGGCGGACAO AGOAAAGCTO 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTGOAOA CATTAOATGG AGGCCAAGAA GATGGCTCTG AGGCGGACAO AGGAAAOCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTGGGGA CATTAGATGG AOOCCAAOAA G ATG GCTC TO AGGCGGACAO AGGAAAOCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTOGGGA CATTAGATGG AGGCCAAGAA GATGGCTCTG AGGCGGACAG AGGAAAOCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGGTTOOAGA CATTAGATGG AGGCCAAGAA GATOOCTCTG AGGCGGACAG AGGAAAOCTG 
ATGACCTCAC CCTCCAGCTC TCCAGTTTTC AGQTTOGAGA CATTAGATGG AGGCCAAGAA GATGGCTCTG AGGCGGACAG AGGAAAOCTG 
ATQACCTCAC CCTCCAOCTC TCCAGTTTTC AGOTTOGGQA CATTAGATGG AOOCCAAOAA GATOOCTCTG AOGCOGACAO AGGAAAOCTG 
CCCACGCGTC CGGTTGGAGA CATTAGATGG AGGCCAAGAA GATGGCTCTG AGGCGGACAG AGGAAAOCTG 
GTTTTC AGGTTOGAGA CATTAOATGG AGGCCAAOAA GATOOCTCTG AGGCGGACAG AGOAAAGCTO 
OTTTTC AGGTTOGAGA CATTAGATGG AGGCCAAGAA GATGGCTCTG AGGCGGACAO AGGAAAOCTG 
GTTTTC AGGTTGGAGA CATTAGATGG AGGCCAAOAA GATGGCTCTG AOGCOGACAO AGOAAAGCTO 
OTTTTC AGGTTGOAOA CATTAGATGG AGGCCAAGAA GATOOCTCTG AGGCGGACAO AGGAAAOCTG 
GTTTTC AGGTTOGAGA CATTAOATOG AGGCCAAGAA GATOOCTCTG AOGCOGACAO AOGAAAGCTG 
GTTTTC AGGTTOGAGA CATTAGATGG AOOCCAAOAA OATOGCTCTO AOGCOGACAO AGOAAAGCTO 
OTTTTC AGGTTOGGGA CATTANATGG AOGCCAAAAA NATGGCTTTN AGO CO Q AC AH AHGAAANCTG 
GTTTTC AGGTTGGAGA CATTAGATGG AGGCCAAGAA OATOOCTTTO AGGCGGACAG AGGAAAOCTG 
TC CGGTTGGAGA CATTAGATGG AOOCCAAOAA OATOGCTCTO AGGCGGACAG AGGAAAOCTG 
TC CGGTTGGAGA CATTAOATOG AGGCCAAGAA OATOGCTCTO AOGCOGACAO AGGAAAOCTG 
TC CGGTTGGAGA CATTAOATOG AGGCCAAGAA OATOGCTCTO AGGCGGACAO AGOAAAGCTO 
TC AGGTTGOAOA CATTAOATOG AGGCCAAGAA OATOGCTCTO AGGCGGACAG AGGAAAOCTG 
C AGGTTGGAGA CATTAOATOG AOOCCAAOAA OATOGCTCTO AOGCOGACAO AGGAAAOCTG 
C AGGTTGGAGA CATTAGATGG AOOCCAAOAA OATOGCTCTO AGGCGGACAG AGOAAAGCTO 
AGGTTOGAGA CATTAOATOG AOOCCAAOAA OATOOCTTTO AGGCGGACAO A t OAAAGCTG 
OG TTGOAGA CATTAGATGG AGGCCAAGAA GATGGCTCTG AGGCGGACAG AOGAAAGCTG 
GO TTGOAGA CATTAOATGG AOOCCAAOAA OATOGCTCTO AGGCGGACAO AOGAAAGCTG 
OG TTGOAGA CATTAOATOG AGGCCAAGAA OATOGCTCTO AGGCGGACAG AOGAAAGCTG 
GTTGOAOA CATTAOATOG AOOCCAAOAA OATOGCTCTO AGGCQGACAG AOGAAAGCTG 
G TTGOAGA CATTAGATGG AGGCCAAGAA OATOGCTCTO AGGCGGACAG AGGAAAOCTG 

A GATOOCTCTG AGGCGGACAO AOGAAAGCTG 
AOGCOGACAO AOGAAAGCTG 
OAAAGCTG 
AAAQNTG 
AAAGNTG 
AGCTO 



21ella 

Sequenaher* 11 ^lellraaefinal" 



jtha*103©2w2 >*1> 

TO 



#361 ATGACCTCAC CCTCCAOCTC TCCAOTTTTC AGGTTGGAOA CATTAGATGG AGGCCAAGAA OATGGCTCTG AGGCGGACAG AGGAAAGCTG 
MTSP SS6 PVP R L E T LDG OQB DOSE ADR OKI* 
* + • + •+ + •+••«,* , . ... * ■ . 
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flhXc21«llrl 1451 
Oenbank N26«. #450 
Oenbank N29~ #437 
flhXc21elli2 t411 
flhXc21ollg2 #338 
flbXc21ellgl #320 
flhX2lellrt9„ 1290 
flhX21ellrl9... #290 
flhXc21ellhl #290 
flhX21ellr21~ #267 
flhXd31ellbl #250 
flhXc21ellil #250 
£lhJCc21ellbl #250 
flhXc21ellb2 #250 
jthsal03e2b2 #242 
jthsal03e2bl #242 
Jthsal03e2t2 #242 
jtfasal03e2tl #241 
Oenbank ML #230 
jthsal03e02t~ #230 
flhX21ellr21_ #215 
flhX21ellr21~ #196 
flhX21ellr2L. #193 
Oenbank H20. #131 
Oenbank H50- #130 
Oenbank H49... #130 
flhX21ellr20~ #112 
flhX21ellr20-. #112 
flhX21ellr22_ #112 
flhX21ellr22^ #112 
flhX21ellr20_ #112 
flfcX21ellrrl3~ #71 
flbX21ellrl8v2 #67 
flhi21«llrievl #67 
flhX21ellrl6g2 #67 
flhX21ellrl8g2 #67 
flh»lellrl5g2 #67 
OhX21ellrlSgl #67 
flM21ellr20al #67 
eihX21ellrl8a2 #67 
£lhX21ellrl7v2 #63 
fUiX21ellrl7g2 #63 
flbX21ellrl7al #63 
OtdQlellrlSal #63 
£lhX21ellrl5a2 #62 
flhX21ellrl8al #62 
flh3aiellrl6a2 #61 
flhX2lellrrl3-. #60 
flbJQlellrl7vl #60 
£lhX21ellrl7a2 #60 
£lhX21ellrl6gl #59 
flh3Qlollrl6al #59 
Oenbank AA74- #32 
£lhX21«llr22gl #21 
Oenbank AA768* #9 
flfaX21«Url9a2 #6 
flhx21ellrl9al #8 
Oenbank All 2 6- #6 
jttoal03a2w2 #3 



GATTTTTGGA GCOGGCTGNC TTCNTTGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGNT 
GATTTTGGGA GCGGGCTGCN TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGG 

GATTTTGGGA NCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGtC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GA : TTTGGGA GCCGGCTG 
GATTTTGGGA GCGGGCTGCC T j CCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA NCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTNGGAA GCGGGCTGCC TCCCATGAAG 
GATTTTGGQA GCGGGCTGCC TCCCATGAAG 
GATTTTGGGA GCGGGCTGCC TCCCATGGAG 
GATTTTNGGA NCGGGCTGCC TCCCATGHAG 



TCACAGTTiC ACGGNGANGA CCGGNAATTC 

TCACAGTTCC AGGGCGAGGA CCGGAAATT 
TCACAGTTCC AGGGCGAGGA CCOGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 

TCACAGTTCC A 

TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATT 
TCACAGTTCC AGGGCGAGGA CCGGAAATT 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATT 
TCACAGTTCC AGGGCGAGGA CCGGAAATT 
TCACAGTTCC AGGGCGAGGA CCGGAAATT 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTNC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 

TCACAGTTNC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATT 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATT 
TCNCAGTTCA AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATTC 
TCACAGTTCC AGGGCGAGGA CCGGAAATT 
TCACAGTTCC AGGGCGAGGA CHONAAATTC 



CCCCC 



GCCCCTCAGA TAAGAGT 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 



GCCCCTCAGA TAAGAGTCAC TC 
GCCCCTCAAA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
CTCAGA TAAGAGTCAA CCTCAACTAC 
CTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
CTCAGA TAAGAGTCAA CCTCAACTAC 
CTCAGA TAAGAGTCAA CCTCAACTAC 
CTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAATCAA CCTTAACTAC 

GCCCCT 

GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAANAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 

CTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 

CTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CTTCAACTAC 

CTCAGA TAAGAGTCAA CCTCAACTAC 
GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 



21ella 

Seguencher <,< ^lellraoefinal* 



Rjth»al03e2xl >il> 
RflMQlellria- >#1> 
f§|fUiX21«llr31~ >#1> 
@fihJQl9llrl7 M >ll> 
(3 flWQlellr22-. >ll> 
0 nhxnellrrL- >»1> 
£9 Gaabank AM. >*1> 
@flhX21ellrrl- >•!> 
f§| flWQlellrrL.. >U> 
QfUUQlellrrt.- >«1> 
£3 Oenbonk AA8- >ll> 
@ Oenbaiik AA2- >♦!> 
@ jthaal03e2wX >ll> 



TTTGGGA GCGGNCTGCN TCCCATGHAG TCACAGTTCC AG ON CO AGO A CCGNAAATTC GCCCNTCAQA TAAGAGTCAC CCTCAACTAC 
TOCC TCCCATOOAO TCACAGTTCC AGGGCGAGGA CCGGAAATTC GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
AG TCACAGTTCC AGGNCGAGGA CCGGAAATTC GCCCCTCAGA TAAOAGTCAA CCTCAACTAC 
AG TCACAGTTCC AGGGCGAGGA CCGGAAATTC GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
TCACAGTTCC AGGGCGAGGW CCGGAAATTC GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
TCC A i GGCGAGGN CGGAAATTTC NCCCNT C AG A TAAAAGTCAC CCTCAACTKC 
AGGGCGAGGA CCGGAAATT CTCAGA TAAGAGTCAA CCTCAACTAC 

AGGNCGAAGA CCGGAAATTC GCCCNTCAQA TAAGAGTCAC CNTCANCTiC 
GGGCGAGGA CCGGAAATTN GCCCNTCAQA TAAGAGTCAA CCTCAACTAC 
GGC i AGO : NCGGAAATTC GCCCCTCAAA TAA t AGTCAA CCTCAACTNC 
GGA CCGGAAATT CTCAGA TAAGAGTCAA CCTCAACTAC 

TC GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 

TAC 



1451 GATTTTGGGA GCGGGCTGCC TCCCATGGAG TCACAGTTCC AGGGCGAGGA CCGGAAATTC GCCCCTCAGA TAAGAGTCAA CCTCAACTAC 
DPGS GLP PME SQPQ GED RKF APQI R V N L N Y 

♦ +♦ • + • + •+ • +• • + • + « + +« ••+«•+*+• + « **•*+« 
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S@nhXc21eUg2 #428 
K9flfaXc2lellgl #410 
JKtlhX2l©llrl9^ #380 
l@£lfcX21eltrl9-. #380 
f£DflbXc21ellhl 4380 
^£lhXc21ollbl #340 
¥3flhXc2lOllb2 #340 
J@jthsal03e2b2 #332 
KljthBal03e2bl #332 
Phthsal03e2t2 #332 
Pljthsal03e2tl #331 
@ Oenbonk AA4-. #320 
f3 JthS4l03o02t« #320 
[KflhX21ellr21-. #305 
^flhX21ellr2L. #286 
BfUtiQlellrtU. #283 
Oenbank H20_ #221 
Genbank H50-. #220 
£1 Oanbank K49~ #220 
@flhxaiellr20- #202 
f£3 £ lhX21ellr20- #202 
f£) flhX21ellr2Z- #202 
IKflhX21ellr22_ #202 
@OhX21ellr20-. #202 
El flMC21ellrrl-. #161 
£3flhX21ellrl6_ #157 
RfUUQlellrie-. #157 
BflMBlellrlS- #157 
@ahX21«llrl5. #157 
^3 £lhniellr20_ #157 
flhJQlellrl8™ #157 
i@nhX2l«llrl7^ #153 
HfliUQlellrl7-. #1S3 
f3 flMC21elin7_ #153 
|R mjQlellrlS-. #153 
i@flhX21ellrl5- #152 
B £lhX2lQllrl8^ #152 
£3 flhX21ellrl6.. #151 
BahJQleUrrL. #150 
S flhX21ellrl7-. #150 
H nhjaiellrl7« #150 
C3flfaX2lGllrl6-. #149 
fiflhX31ellrl6- #149 
B Genbank AA7 W #132 
Rflhfclellr22.- #U1 
H Genbank AA76-. #99 
Hflh3QloUrl9a2 #96 
9 eihX21«llrl94l #98 
H Geobank AI12. #96 
Rjthsal03e2v2 . #93 
Bjthsal03e2xl #88 
BfflbilellrlSgl #7S 
BohJCllellrtlgl #63 
RofaX21ellrl70l #63 
9<0h3OloUt22g2 #61 
9flfcX21ellrrl2_ #54 
H Oeabtnk AA46~ #51 
RflhX21<airrt3, #51 
3flhX21«Urrl3- #50 



CGAAAGGGAA 
COAAAGOOAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGA 
CGAAAGGGA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAA t GGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAAGGGAA 
CGAAA t GGAA 
CGAAAGGGAA 
CGAAAAOQAA 
NQAAAGGGAA 



CAGQTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 



TCAGCCGGAT 
TCAOCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 



CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 



TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTNCCCGAGA 
TTGACCGAGA 



TCGGCTCTNC 
TCGGCTCNWC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTN 
TCGGCTKTTT 



AATGCGGTCT 
AATOCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
ATTGCGGGCT 
AATGCGGGCT 



CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCGGGGGTGT 
CCCGGGGTGT 



CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCNAAGGAT 
CCCGAAGGAT 



CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGQTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGATGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAGGTGCCAG 
CAOGTCCCAG 
CAGGTGCCAG 
CAGGTCCCAG 
CAGGTGCCAG 



TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAOCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
HC AG CC GO AT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAGCCGGAT 
TCAOCCGGAT 
TCACCCGGNT 
TCA : NCGGAT 



CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAi 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CCAAACCGAT 
CNAAACCGAT 
CCtAANC AT 



TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCNAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TTGACCGAGA 
TNGACCGA0A 
TTGACCGAGA 
TTGACNGAGA 
TTGACCGAGA 



TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTT 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCQGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
A 

TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCQGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGCTCTTC 
TCGGNTTTTC 
TCGGCTCTTC 
TCGO j TTTTC 
TCGGCTCTTC 



AATGCGGTCT CCCGGGGTGT : CCCGAGGAT 
AATGCGGTCT CCCGGGGTGT CCCCGAGGAT 



AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCHGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 



CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
NCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 



CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCNAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
i CCCGAGGAT 
CCCCGAGNAT 
CCCCGANNAT 
CCCTOAGGAT 
CCCTGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCTGAGGAT 
CCCTGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 



AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATGCGGTCT 
AATCOGTTCT 
AATGCGGTCT 
AATCGGGTCT 
AATCOGTTCT 



CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGGGTGT 
CCCGGOOTGT 
CCGGGGGTGT 
CCCGGGGTGT 
CCNOGGGTGT 
CCGGGGGTGT 



CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCTAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGNAT 
CCCCGAGGAT 
CCCCQANGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCCGAGGAT 
CCCNGAGGAT 



21ella 

Sequaxfaet™ n 21ellraoefinal* 



(58fUOQXellrrl4-. #49 
3 Oanbank AA81- #44 
8 Oenhank AA23, #33 
3jthsal03e3wl #4 
?3flhX2lellrr9~ >#1> 
3|jthaal03e2x2 >#1> 
3 Genbank AI0~ >#1> 

IS41 



CQAAAGGGAA CAGGTGCCA: TCAACCGGAT CCAAACCOAT 

CQAAAGGGAA CAGGTGCCAG TCAGCCGGAT CCAAACCOAT 

CGAAAGG : AA CAGGTGCCAG TCAGCCGGAT CCAAACCOAT 

MQAAAGGGAA CAGGTGCCAG TCAGCCGGAT CCAAACCOAT 

AAAAGGAA CAGGTGCCAG TCAGCCGGAT CCAAACCOAT 

CAG TCAGCCGGAT CCAAACCOAT 



TTOACCOAOA 
TTOACCOAOA 
TTOACCOAOA 
TTOACCOAOA 
TTOACCOAOA 
TTOACCOAOA 



TiOOOTTTTC 
TCOOCTCTTC 
TCGGCTCTTC 
TCOOCTCTTC 
TOGOCTCTTC 
TCOOCTCTTC 



AATCOOOTCT 
AATOCOOTCT 
AATOCOOTCT 
AATOCOOTCT 
AATCGOTTCT 
AATOCOOTCT 
CT 



CCCQOOGTOT 
CCClGGGTOT 
CCCQOOGTOT 
CCCGGOOTOT 
CCHOQOGTOT 
CCCQOOGTOT 
CCCGGOOTOT 



CCCCOAOOAT 
CCCCOAGOAT 
CCC i OAOOAT 
CCCCOAOOAT 
CCCCOAOOAT 
CCCCOAOOAT 
CCCCOAOOAT 



CGAAAGGGAA CAOGTOCCAG TCAGCCGGAT CCAAACCOAT TTOACCOAOA TCGGCTCTTC AATOCOOTCT CCCOCGGTQT CCCCOAOOAT 

RKGT OAS QPD PNRF DRD RLF NAVS ROV 
++ * • • « • + •+« + +♦ + • ++• +• «• • •++« 



* • + • • 



P B D 
♦ ♦ ++ 



21ella 

Sequendbea^ tt 21ellcacefinal rt 



QnbXc31«llg2 1518 
@OhXc31ell0l 1500 
0mo21ellrl9 w #470 
@flhX2l©llrl9- 1470 
flflhXcaieUhl 1470 
9 flhXc21ellbl 1430 
9*lhXc21ellb2 1430 
@jthsal03e2t2 1433 
9jthsU03e2tl 1431 
9 jthfl4l03e02t- 1410 
9*lhmellr21_ 1395 
@n^Qlellr21 m §376 
9flhX21ellr2L- #373 
9 Genbank H20. «311 
9 Genbank KSOu #310 
@ Genbank H49- 1310 
9 flhX21ellr20- #292 
9 flhX2iellr20« #292 
@£lhX31ellr23 M #293 
9 ClbX21ellr22 M 4292 
9flhX21ellr20.. §292 
9 OWQlellrrL. 1251 
K)flhX21ellrl6_ #247 
@flhX21ellrie- #247 
9eih»lellrl5.. #247 
9 flhX21ellrl5„ #247 
9*lnX21ellr20- 4247 
3nbX3ieune. #347 

9 fthX21<airt7- #343 
KflhX21ollrl7- #243 
9*lnX21ellrl7~ 4243 
9 fUiX21ellrl5. 4243 
9 £lhX21«llrl5~. #242 
9 flhX21ellrie_ 4242 
9ftnX21ellrl6-. 4241 
9 eifaX21ellrl7. 4240 
S GhX21ollrl7_. 4240 
9 *lhX21ellrl6- 4239 
R nhX21ellrl6 M #239 
9 Genbank AA7« #212 
9 t lhX21eUr22- 4301 
9 Gonbank AA7- #189 
9 flhX21ellrl9.- #168 
9 flhX21«llrl9-. 4188 
9 Genbank Ail. #186 
9jthsal03e2w2 4183 
Bjthoal03o2xl 4176 
9 flhX2l9llrl8~ #165 
9flhX21ollr21- #153 
9 flhX21ollrl7-. #153 
9flhmellr22-. #151 
9nhX21ellrn_ #144 
9 Genbank AA4- #141 
9 flhX21ellrrL- #141 
BflhJQlellrrU 4140 
9flhX21allrrt_ 4139 
9 Genbank AA8-. #134 
9 Oeobaak AA2- 4123 
9jthft*103«2wl 494 



CTGGCTOOAC TTCCAGAQTA CCTGAGCAAG 
CTQGCTGQAC TTCCAGAQTA CCTGAGCAAG 
CTGGCTOOAC TTCCAGAQTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAQTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAQTA CCTGAGCAAG 
CTGNTTGG AC TT 

CTGGCTGGAC TTTCAGAGTA CCTGAGCCAG 
CTGGCTGGAC TTHCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAQTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAQTA CCTGAGCAAG 
CTQGCTGQAC TTCCAGAQTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAQTA CCTGAGCAAG 
CTQGCTGQAC TTCCAGAQTA CCTGAGCAAG 
CTGQCTGGAC TTCCAGAQTA CCTGAGCAAG 
CTGQCTGQAT TTCCAGAQTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAQTA CCTGAGCAAG 
CTQGCTGQAC TTCCAGAGTA CCTGAGCAAG 
CTQGCTGQAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTQQAC TTCCAGAQTA CCTGAGCAAG 
CTGQCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTQGCTGQAC TT 

CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGQCTGGAC TTCCAGAQTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAQTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGQCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTTCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGQCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGQCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTQQAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGQCTGGAC TTCCAGAQTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTQQAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTQQAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGQCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGNAC TTCCANAGTA CCTGAGCAAG 
CTGQCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGNAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCA AGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGQCTGGAC TTCCAGAGTA CCTGAGCAAG 
CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG 



ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGN CTCGGAATiC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 

ACCAGCAAQT ACCTCiCCQA CTCQGAATAC 
ANCAGCAAGT 

ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTiACCGA CTTGGAATAC 
ACCAGCAAGT AC 

ACCAGCAAQT ACCTCACCGA CTTGGAATAC 
ACCAGCAAGT ACCTCNCCGA CTCGGAA 
ACCAGCAAGT ACCTCACCGA CTCQGAAT 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 

ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAQT ACCTCACCGA CTCGGAATAC 
ACCAGCAAQT ACCTCACCGA CTCGGAATAC 
ACCAGCAAQT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAA 

ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAQT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCT t NCCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACC AG C AAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAQT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAQT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAQT ACCTCACCGA CTCQGAATAC 
ACCAGCAAQT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCNCCGN CTCG 
ACCAGCAAGT KCCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACC AQC AAGT NCCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCQGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 
ACCAGCAAGT ACCTCACCGA CTCGGAATAC 



ACAGAGGGNT CNA 

AC AG AGOG NT CCA 
NCAGAGQG 
ACAQAGGGG 

ACAGAGQGCT CCACAGGTAA QACQTQCCTG 

ACAGAAGGCT TCACAAGTAA AACGTGCCTG 

ACAGANOGCT KCACANGTAA GACGTGCCTG 
ACAGAGQGCT CCACAGGTAA GACGTGCCTG 
ACAGAGQGCT CCACAGGTAA GACGTGCCTG 
ACAGAGQGCT CCACAGQTAA GACGTGCCTG 
ACAGAGQGC 

ACAGAGQGCT CCACAGGTAA GACGTGCCTG 
AC AG AGO GOT CCACAQGTAA GACQTOCCTG 

ACAGAGQGCT CCACAGGTAA QACQTQCCTG 
ACAGAGQGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 

ACAGA 
ACAGA 
ACAGA 
NCAGAQ 

ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 

ACAGA 

ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGANOGCT TCACANGTNA AACGTGCCTT 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
NCNGA 

ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
NCAGA 

ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGQTAA QACQTQCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGAGQGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA . GACGTGCCTG 
ACAQAGG 
ACAG 
ACAGA 

ACAGAGGGCT CCACAGGCAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGQTAA GACGTGCCTG 
ACAGAGGGCT CCACAQGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 
ACAGAGGGCT CCACAGGTAA GACGTGCCTG 



2le11a 

Sequencher*" ^lellraoefiiial* 



9flhX3l«llrr9c_. #89 
r9jths&103«2x2 174 
t9 Oenbank AI00- 123 
B*tfasal03e2al >«1> 
@jehaal03e2a3 >#1> 

1631 



CTGGCTGGAC TTCCAOAOTA CCTGAGCAAG ACCAGCAAOT ACCTCACCGA CTCGGAATAC AC AG AGOG CT CCACAGGCAA GACGTGCCTG 

CTOGCTGGAC TTCCAOAOTA CCTOAQCAAG ACCAGCAAOT ACCTCACCGA CTCGGAATAC AC AO AGOG CT CCACAGGTAA OACGTGCCTG 

CTGGCTGGAC TTCCAOAOTA CCTGAGCAAG ACCAGCAAOT ACCTCACCGA CTCGGAATAC ACAGAGGOCT CCACAGGTAA GACGTGCCTG 

CTCGGAATAC ACAGAGGOCT CCACAGGTAA GACGTGCCTG 

CTCGGAATAC ACAGAGGOCT CCACAGGTAA GACGTGCCTG 



CTGGCTGGAC TTCCAGAGTA CCTGAGCAAG ACCAGCAAOT ACCTCACCGA CTCGGAATAC ACAGAGGOCT CCACAGGTAA GACGTGCCTG 
LAOL P B Y h S K TSKY I*TD SKY TE 



B 



R|flhXc21ellhl 1560 
§) flhXc21ellb2 #520 
H Jthaal03e2tl 1511 
Qjthsal03e02t- #500 
9flhX21allr21_ 1465 
QflhX21ellrtU. #466 
9 Oenbank H2(L, #401 
9 Oenbank H49„ #400 
9 f lbX21ellr22- #362 
9flnX21ellr22- 1382 
9 flhX21ellr20-. #382 
9 flteQlellrtO. #337 
9afaX21ellrl8_ #337 
9 OhX21ellrl7. #333 
9 flhX21«llrl5-. #333 
9 flhX21eHirl5^ #332 
9 flnX21«llrie_ #332 
9flhX21ellrl6-. #331 
9 £lhX21ollrt7_ #330 
9flhX21«llrl7 M 1330 
9 £lnJ31ellrl6~ #329 
9 Genbonk. AA7_ #302 
9 Oenbank AA7- #279 
9 flhX21ellrl9~ #278 
9 flhX21ellrl9- #278 
9 Oenbank AIL. #276 
9 jtfa*al03e2w2 #273 
9jttwal03e2xl #268 
9flbX21«llrrl_ #234 
9 Oenbank AA4- #231 
9flhX21ellm. #231 
9nhX21ellrrl_ #230 
9cihX21allrrU #229 
9 Oenbank AAB. #224 
9 Oecbank AA2„ #213 
9jthsal03e2wl 1184 
9 flbX21ellrr9-. #179 
8jthsal03e2x2 #164 
.9 Oenbank AXO- #113 
9 3*thsal03e2al #41 
9jthsal03e2a2 #41 
9nhX21elim« >#1> 



ATiAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
AT OA A 

ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAANGCTT 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATOAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 
ATGAAGGCTG 



TOCT 
TOCT 

TGCTGAACCT TAAGGACNGA GTCAATGCCT GCATTNTOC 

TGCTGAACCT TAAGGACNGA GTCAATGCCT GCATTCTGCC 

TOCTGAAC 

TGCTGAACC 

TG 

TGCTGAACC 
T^GCTGAACC 
TGCTGAACC 
TGCTGAACC 
AGCTGAACC 
TGCTGAAC 
TGCTGAACC 
TGCTGAACC 
AGCTGAACC 
TGCTGAAC 
TGCTOAAC 
TGCTGAAC 
TGCTGAACC 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 
TGCTGAACCT 



ACTGCTGCAG ATCGACAGGO ACTNTGOCAA TCCTNAGCCC 



TAAGGACGGA 
TAAGGACGGG 
TAAGGACGGG 
TAAGGACGGG 
TAAGGACGGA 
TAAGGACGGA 
TAAGGACGGA 
TAAGGACGGA 
TAAGGACGGA 
TAAGGACGGA 
TAAGGACGGA 
TAAGGACGGG 
TAAGGACGGG 
TAAGGACGGG 
TAAGGACGGA 
TAAGGACGGA 
TAAGGACGGA 
TAAGGACGGG 
TAAGGACGGA 
TAAGGACGGA 



GTCAATGCCT 
GTCAATGCCT 
GTCAATOCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCHT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 
GTCAATGCCT 



GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGTC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGTC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 
GCATTCTGCC 



ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTOCAO 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTGCAG 
ACTGCTOCAO 



ATCGACAGGO 
ATCGACAOGG 
ATCGACCOGG 
ATCGACCGGG 
ATCGACAGGO 
ATCGACAOGG 
ATCGACAOGG 
ATCGACAGGO 
ATCGACAOGG 
ATCGACAGGO 
ATCGACAGGO 
ATCGACAGGO 
ATCGACAOGG 
ATCGACCGGG 
ATCGACAOGG 
ATCGACAGGO 
ATCGACAGGO 
ATCGACCGGG 
ATCGACAGGO 
ATCGACAOGG 
AOOG 



ACTCTGGCAA 
ACTCTGGCAA 
ACTCTGGCAA 
ACTCTGGCAA 
ACTCTGGCAA 
ACTCTOOCAA 
ACTCTGGCAA 
ACTCTOOCAA 
ACTCTGGCAA 
ACTCTGGCAA 
ACTCTGGCAA 
ACTCTGGCAA 
ACTCTGGCAA 
ACTCTGGCAA 
ACTCTOOCAA 
ACTCTGGCAA 
ACTCTGGCAA 
ACTCTGGCAA 
ACTCTOOCAA 
ACTCTGGCAA 
ACTRTGGCAN 



TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAOCCC 
TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAOCCC 
TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAGCCC 
TCCTCAOCCC 
TCNTCAOCCC 



#721 ATGAAGGCTG TGCTGAACCT TAAGGACGGA GTCAATOCCT GCATTCTGCC ACTGCTOCAO ATCGACAGGO ACTCTGGCAA TCCTCAOCCC 
MKAV h U L KDG VNAC ILP LLQ I D R D SO 



P Q 
+ + 
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Seguaxher* 1 " n 21elLraoefinal 1 ' 



0 jtfaaalO3e02U 1590 
£3 Oanfaank AA7- 1392 
£3 Genbank AA7„ 1369 
@flhX31ellrl9 w 1366 
KUlhX21ellrl9„ #368 
0 Oeubcoik AIU. 1366 
IK jthsal03e2w2 1363 
Bjthsal03e2xl #358 
fHflhmellrrU. #324 
0 Genbank AM- 1321 
ISJflMQlellrrL- 1321 
QflttQlellrrL- #320 
£9 flhX21ellrrL- #319 
© Oenbank AA8~ #314 
Ooobank AA2- #303 
@jth&al03e2wl #274 
0 flWQlellrr*.- #269 
Kjthsal03e2x2 #254 
K Genbank AX0~ #203 
@jtbsal03o2al #131 
Edjthsal03e2&2 #131 
@£lhX21ellrrl4 w #25 

#811 



CTOOTAAATG 
CTOGTAAATG 
CTOOTAAATG 
CTOGTAAATG 
CTOOTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 
CTOGTAAATG 



CCCAOTGCAC 
CCCAGTGCAC 
CCCAOTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTOCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 
CCCAGTGCAC 



AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AQATOACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 
AGATGACTAT 



TACCOAO0NC 
TACCGAGOCC 
TACCOAOOCC 
TACCOAOOCC 
TACCGAGOCC 
TACCOAOOCC 
TACCOAOOCC 
TACCGAGOCC 
TACCGAGOCC 
TACCGAGOCC 
TACCGAGOCC 
TNCCGAGGCC 
TACCGAGOCC 
TACCOAOOCC 
TACCGAGOCC 
TACCGAGOCC 
TACCOAOOCC 
TACCGAGOCC 
TACCGAGOCC 
TACCGAGOCC 
TACCGAGOCC 
TACCGAGOCC 



ACAOC C 

ACAOCOCTCT 

ACAOCG i TCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAQCGCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 

ACAOCOCTCT 



OCACATCGCC 
OCACATCOCC 
OCACATCGCC 
OCACATCGCC 
OCACATCOCC 
OCACATCGCC 
OCACATCGCC 
GCACATCGTC 
OCACATCGCC 
OCACATCGCC 
OCACATCGCC 
GCACATCGCC 
GCACATCGCC 
GCACATCGCC 
GCACATCGCC 
GCACATCGTC 
GCACATCGCC 
GCACATCGCC 
GCACATCGCC 
GCACATCGCC 
GCACATCGCC 



ATT 
ATT 

ATTGAGAAGA 
ATTQAGAAGA 
ATTGAGAAGA 
ATTGAGAAGA 
ATTQAGAAGA 
ATTGAGAAGA 
ATT 

ATTGAGAAGA 
ATTGAGAAGA 
ATTGAGAAGA 
ATT 

ATTGAGAAGA 
ATTGAGAAGA 
ATTGAGAAGA 
ATTGAGAAGA 
ATT 

ATTGAGAAGA 
ATTGAGAAGA 
ATTGAGAAGA 



GGAGTCTGCA 
OGAGTCTOCA 
GGAGTCTGCA 
OGAGTCTOCA 
GGAGTCTGCA 
OGAGTCTOCA 



OTOTGTOAAO 
OTOTGTOAAO 
OTOTGTOAAO 
GTOTGTGAAG 
OTOTGTOAAO 
GTGTOCGAAO 



GGAGTCTGCA GNGTGTNAAG 
GGAGTCTGCA GTGTGTGAAQ 
OGAGTCTOCA GTOTGTGAAG 

OGAGTCTOCA GTOTGTGAAG 
OGAGTCTOCA GTGTGTGAAQ 
OGAGTCTOCA GTGTGCGAAG 
GGAGTCTGCA GTOTGTGAAG 

OGAGTCTOCA GTOTGTGAAG 
OGAGTCTOCA GTGTGTGAAG 
OGAGTCTOCA GTGTGTGAAG 



CTOGTAAATG CCCAGTGCAC AQATOACTAT TACCOAGGCC ACAOCOCTCT GCACATCGCC ATTGAGAAGA GGAGTCTGCA GTGTGTGAAG 
l# V N A QCT DDY YRGH SAL H I A IBKR SLQ CVK 



J 
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F?l flb30l6llrt9_. #458 
flfflhJaiellrlfc- 1456 
£3 Oehbank All- 1456 
@jthsal03e2w2 #453 
S9jthsal03e2xl 1448 
f8flhjaiellrrl. 1414 
£D £lhX21ellrrU. #411 
@ £lhX2l0llrrt~ #410 
0flhX2lellrrl.- #409 
£1 Oenbank AA2~ #393 
Kbthsal03e2wl #364 
@flfaX21eUrr9~. #359 
I3jtto«103e2x2 #344 
E3jtfas&103e2&l #231 
I@jthsal03e2a3 #231 
@ OfcXUellrrl.- #115 
£1 Crhobl2c4gl . „ >#1> 
flfrtiob012c04s- >#1> 
f| frtwb012c04s- >#1> 
@flhX31ellxha2 >#1> 
KS9flhX21allnRv2 >#1> 
IK flWQlollrl vl >«1> 
®frt»CUc4h3.~ >«i> 
@niiX31ellrlv2 >#1> 
@flhX31011zxnvl >tl> 
^nhX21ellrj&2 Hl> 
Oonbonk AA9- >f 1> 
@ Genbank AA9_ >#]> 



CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 
CTCCTOOTGO 



AGAATGGGGC 
AGAATGGGOC 
AOAATOOGGC 
AGAATGGGOC 
AGAATGGGOC 
AOAATOOGOC 
AOAATOGOOC 
AGAATGGGGC 
AGAATGGGOC 
AGAATGGGGC 
AGAATGGGOC 
AOAATOOGOC 
AGAATGGGOC 
AOAATOOGOC 
AOAATOOGOC 
AOAATOOGOC 



CAATGTGCAT 

CAATGTGCAT 

CAATG 

CAATGTGCAT 

CAATGTGCAT 

CAATOTOCAT 

CAATGTGCAT 

CAATGTGCAT 

CAATGTGCAT 

CAATGTGCAT 

CAATGTGNAT 

CAATGTGCAT 

CAATGTGCAT 

CAATGTGCAT 

CAATGTGCAT 

CAATGTGCAT 



GCCCOGGCCT QCGGCCGC 
OCCCOGGCCT GCGOCCOC 



GCCCOGGCCT 
GCCCOGGCCT 
OCCCOGGCCT 
GCCCOGGCCT 
GCCCGOOCCT 
OCCCOGGCCT 
GCCCOGGCCT 
GCCCOGGCCT 
GCCCOGGCCT 
OCCCQOOCCT 
GCCCOGGCCT 
GCCCOGGCCT 
OCCCOGGCCT 



GCGOCCOCT 

GCGOCCOCT 

OCGACCGCTT 

OCGOCCOCTT 

GCGOCCOCTT 

OCGOCCOCTT 

G 

GCGOCCOCT 
OCGACCOCTT 
GCGOCCOCT 
GCGOCCOCT 
GCGOCCOCT 
GCGOCCOCTT 
GCGOCCOCTT 
GCGOCCOCTT 
GCGOCCOCTT 
CGOCCOCTT 
OCCGCTT 
CCOCTT 
TT 



CTTCCAGAAQ OGCCAAOOGA CTTOCTTTTA TTTCGGTGAG 

NTTCCAOAAO OGCCAAOOGN CTTOCTTTTA TTTMOGTNAG 

CTTCCAGAAQ OGCCAAOOGA CTTOCTTTTA TTTCGGTGAG 

CTTCCAGAAQ OGCCAAOOGA CTTOCTTTTA TTTCOGTGAO 



CTTCCAGAAQ GGCCAAGGGA CTTOCTTTTA TTTCGGTGAG 



CTTCCAGAAQ 
CTTCCAGAAQ 
CTTCCAGAAG 
CTTCCAGAAQ 
CTTCCAGAAQ 
CTTCCAGAAG 
CTTCCAGAAQ 
CTTCCAGAAG 
TTCCAOAAG 
AGAAG 



GGCCAAGGGA 
OGCCAAOOGA 
OGCCAAOGOA 
OGCC t AOQGA 
OOCCAAGGGA 
GGCCAAGGGA 
GGCCAAGGGA 
GGCCAAGGGA 
GGCCAAGGGA 
GGCCAAGGGA 
AGOGA 



CTTOCTTTTA 
CTTGCTTTTA 
CTTOCTTTTA 
CTTOCTTTTA 
CTTOCTTTTA 
CTTOCTTTTA 
CTTOCTTTTA 
CTTOCTTTTA 
CTTOCTTTTA 
CTTOCTTTTA 
CTTOCTTTTA 



TTTCGGTGAO 
TTTCGGTGAG 
TTTCOGTGAO 
TTTCGGTGAG 
TTTCGGTGAG 
TTTCGGTGAO 
TTTCOGTGAO 
TTTCOQ : NAG 
TTTCGGTGAO 
TTTCGGTGAG 
TTTCOGTGAO 
TCGGTGAO 
TCGGTGAG 



#901 CTCCTGGTGG AGAATGGGGC CAATGTGCAT GCCCOGGCCT GCOGCCGCTT CTTCCAGAAQ GGCCAAGGGA CTTGCTTTTA TTTCOGTGAO 
LLVE NGA NVH A R A C GRP F Q K GQGT CFY FOE 

+ • + • + + •+ 



i 
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3) flhxaielim^ 1504 
S3flhX21allrrl~ 1501 
3 f lhttUllrrl... 1500 
9flfcX21ellrrU 1499 
^flfcX21ellrr9. 1449 
flflhmellrrl- 1205 
BfrhobiacAgl.a- #51 
§)frbob012c04sl~ 151 
RUrhdb012c04sl.. #51 
@)nfaX21ellrha2 #50 
P9nhX21ellnmr2 #48 
@ flhlQlellrlvl #47 
Rtxhod2c4h3.au #43 
£3 eihX21ellrlv2 #40 
£9 flhX21elliwl #36 
@ eihX31ellrja2 #26 
£3 Genbank AA992-. #9 
Genbank AA99S- #9 
P8flhX21ellrr9-. >#1> 
@ flhX21ellrr9~ >#1> 
S3jthKb051a01t-. Xl> 
^flhmellrrl. >#1> 
@OhX21ellrrl. >#1> 



CTACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CNACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CTACCCCTCT 
CCCCTCT 
CCCCTCT 



CTTTGOCCOC 
CTTTGGCCGC 
CNTTGOCCGC 
CTTTGGCCGC 
CTTTGGCCGC 
CTTTGHCCGC 
CTTTGNCCGC 
CTTTGGCCGC 
CTTTNGCCGC 
CTTTGGCCGC 
CTTTGGCCGC 
CTTTGGCCGC 
CTTTGGCCGC 
CTTTGGCCGC 
CTTTGGCCGC 
CTTTGGCCGC 
CTTTGGCCGC 
CTTTGGCCGC 
CTTTGGCCGC 
CTTTGGCCGC 
CTTTGGCCGC 



TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCACCAAG 

TTGCNCCAAG 



CAGTGGGATG 
CAGTOOGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTOGNATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGTGGGATG 
CAGNGGGATG 



TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TGGTAAGCTA 
TA 
TA 



CCTCCTGOAG 

CCTCCTGGAG 

CCTCCTGOAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTCCT i GAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTCCTGGAG 

CCTMCTGGAG 

CCTCCTGGAG 

CCTCCTGOAG 



AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACNCC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 
AACCCACACC 



AGCCCOCCAG 
AGCCCHCCAG 
AGCCCGCCAO 
GGCCCGCCAG 
AGCCCGCCAO 
GGCCCGCCAG 
AGCCCGCCAO 
AOCCCGCCAG 
AGCCCGCCAO 
AGCCCGCCAO 
AOCCCGCCAG 
AGCCCGCCAO 
AGCCCGCCAO 
AGCCCGCCAO 
AGCCCGCCAO 
AOCCCGCCAG 
AGCCCGCCAO 
AOCCCGCCAG 
AGCCCOCCAG 
AGCCCOCCAG 
AGNCCGCCAG 
AOCCCGCCAG 
AGCCCGCCAO 



CCTOCAQOCC 
CCTGCAGGCC 
CCTGCAGOCC 
CCTGCAOGCC 
CCTGCAGGCC 
CCTGCAGGCC 
CCTGCAOGCC 
CCTGCAGGCC 
CCTGCAGGCC 
CCTGCAOGCC 
CCTGCAGGCC 
CCTGCAGGCC 
CCTGCAGGCC 
CCTGCAOGCC 
CCTGCAGOCC 
CCTGCAGOCC 
CCTGCAGGCC 
CCTGCAGGCC 
CCTGCAOGCC 
CCTGCAGGCC 
CCTGCAGGCC 
CCTGCAGGCC 
CCTGCAGGCC 



#991 CTACCCCTCT CTTTGGCCGC TTGCACCAAG CAGTGGGATG TGGTAAGCTA CCTCCTGGAO AACCCACACC AGCCCGCCAO CCTGCAGGCC 
LAA CTK Q W D V V S Y LLE NPHQ PAS LQA 
+ + + ++ + * +«+ + 



S 



£3flhX2ieiim-. #594 
£ahX31«llrrL #591 
^3 f lhX21ellrtrl~ #590 
#@flhX21«llrrl- #589 
#£3 nhXllellrrS- #539 
4f3jflhX31ellrrl_ #295 
^fxboibL2c4gl. M #141 
4@ Cxbab012c04s. #141 
#@£xbob012c04fl- #141 
^£lhX21ellrtu2 #140 

ffIbX21ellnnv2 #138 
@flhX2l6llrlvl #137 
^£xfaOCl2c4h3.» #133 

flbX21ellrlv2 #130 
^3)flhX21«llnHVl #126 
@flhX21ellrja2 #116 
Gcsfeank AA99-. #99 
4j£$ Genbank AA99- #99 
|RnhX21«llrr9al #88 
t@flttC21«llrr9a2 #88 
4^ jthKbOSlAOlU- #81 

flMQlellrrtl-. #43 
4@ ClhX21elimi«. #43 



ACTGACTCCC 
ACTOACTCCC 
ACTOACTCCC 
ACTOACTCCC 
ACTGACTCCC 
ACTOACTCCC 
. ANTGACTCCC 
ACTOACTCCC 
ACTOACTCCC 
ACTOACTCCC 
ACTGACTCCC 
ACTGACTCCC 
ACTGACTCCC 
ACTGACTCCC 
ACTGACTCCC 
ACTOACTCCC 
ACTGACTCCC 
ACTGACTCCC 
ACTOACTCCC 
ACTOACTCCC 
ACTOACTCCC 
ACTOACTCCC 
ACTGACTCCC 



AGGGCAACAC 

AGGGCAACAC 

AGGGCAACAC 

AGGGCAACAC 

AGGGCAACAC 

AGOGCAACAC 

AGGGCAACAC 

AGGGCAACAC 

AGGGCAACAC 

AGGGCAACAC 

AGGGCAACAC 

AGOGCAACAC 

AGGGCAACAC 

AGGGCAACAC 

AGOGCAACAC 

AGGGCAACAC 

AGOGCAACAC 

AGOGCAACAC 

AGOGCAACAC 

AGOGCAACAC 

AGGGCAACAC 

AGGGCAACAC 

AGGGCAACAC 



AGTC 

HGT 

AGT 

AOTCC 

AGTC 

AGTC 

AGTCCTGCAT 

AGTCCTGCAT 

AGTCCTGCAT 

AGTCCTGCAT 

AGTCCTGCAT 

AGTCCTGCAT 

AGTCCTGCAT 

AGTCCTGCAT 

AGTCCTGCAT 

AGTCCTGCAT 

AGTCCTGCAT 

AGTCCTGCAT 

AGTCCTGCAT 

AGTCCTGCAT 

AGTCCTGCAT 

AGTC 

AOTN 



OCCCTAGTGA 
GCCCTAGTOA 
OCCCTAGTGA 
GCCCTAGTOA 
GCCCTAGTOA 
OCCCTAGTGA 
GCCCTAGTOA 
GCCCTAGTOA 
OCCCTAGTGA 
GCCCTAGTOA 
OCCCTAGTGA 
GCCCTAGTOA 
OCCCTAGTGA 
OCCCTAGTGA 
OCCCTAGTGA 



TGATGTCGGK 
TGATCTCGGA 
TGATCTCGGA 
TGATCTCGGA 
TGATCTCGGA 
TGATCTCGGA 
TGATCTCGGA 
TGATCTCGGA 
TGATGTCGGA 
. TGATCTCGGA 
TGATCTCGGA 
TGATCTCGGA 
TGATCTCGGA 
TGATCTCGGA 
TGATCTCGGA 



CAANTCAOCT 
CAACTCAGCT 
CAACTCAGCT 
CAACTCAGCT 
CAACTCAGCT 
CAACTCAGCT 
CAACTCAGCT 
CAACTCAGCT 
CAACTCAGCT 
CAACTCAGCT 
CAACTCAGCT 
CAACTCAGCT 
CAACTCAGCT 
CAACTCAGCT 
CAACTCAGCT 



GAOAACATTG 

OAGAACATTG 

GAOAACATTG 

GAGAACA 

GAGAACA 

GAGAACA 

GAOAACATTG 

OAGAACA 

OAGAAC 

GAGAACA 

GAOAACATTG 

GAOAACATTG 

OAGAACA 

OAGAACA 

GAGAA 



CACTGOTGAC CAOCATGTAT 
CACTGGTGAC CAOCATOTAT 
CACTGOTGAC CAOCATGTAT 



CACTGGTGAC CAOCATOTAT 



CACTGOTGAC CAOCATGTAT 
CACTGGTGAC CAOCATOTAT 



#1081 ACTOACTCCC AGGGCAACAC AGTCCTGCAT GCCCTAGTOA TGATCTCGGA CAACTCAGCT OAGAACATTG CACTOGTOAC CAOCATGTAT 
TDSQ GNT VLH A L V. M ISD N S A E N I A LVT S M Y 



21ella 

Sequendher*" n 21ellracefinal* 



Qf£faofaUo4gl.« #231 
Rftfaob012c04&. 1331 
SI fttoob012c04s- 1231 
9frhocl2c4h3.~ 1223 
ft Oenbank AA9- #189 
ft Oenbank AA9- 1189 
0 jthlA029cl0t~ >#1> 
0 Oenbank M38*. >«!> 



QATOGGCTCC 
QATGOGCTCC 
OATGOGCTCC 
GATOGGCTCC 
OATGOGCTCC 
GATOGGCTCC 



TCCAAGCTGG 
TCCAAGCTGG 
TCCAAGCTGG 
TCCAAGCTGG 
TCCAAGCTGG 
TCCAAGCTGG 



GGCCCGCCTC 
GGCCCGCCTC 
GGCCCGCCTC 
GGCCCGCCTC 
GGCCCGCCTC 
GGCCCGCCTC 
CGCCTC 
CCTC 



TGCCCTACCG 
TGCCCTACCG 
TGCCCTACCG 
TGCCCTACCG 
TGCCCTACCG 
TGCCCTACCG 
TGCCCTACCG 
TGCCCTACCG 



TGCAGCTTGA 
TGCAGCTTGA 
TGCAGCTTGA 
TGCAGCTTGA 
TGCAGCTTGA 
TGCAGCTTGA 
TGCAGCTTGA 
TGCAGCTTGA 



GGACATCCOC 
GGACATCCOC 
GGACATCCOC 
GGACATCCOC 
GGACATCCOC 
GGACATCCOC 
GGACATCCOC 
GGACATCCOC 



AACCTNCAGO 
AACCTGCAGG 
AACCTGCAGO 
AACCTGCAGG 
AACCTGCAGG 
AACCTGCAGG 
AACCTGCAGG 
AACCTGCAOG 



ATCTCACOCC 

ATCTCACGCC 

ATCTCACOCC 

ATCTCACOCC 

ATCTCACq 

ATCTCACOCC 

ATCTCACOCC 

ATCTCACGCC 



TCTGAAGCTG 
TCTOAAGCTG 
TCTGAAGCTG 
TCTOAAGCTG 

TCTOAAGCTG 
TCTGAAGCTG 
TCTGAAANTG 



#1171 OATGOGCTCC TCCAAGCTOG OGCCCGCCTC TGCCCTACCG TOCAOCTTOA GGACATCCOC AACCTGCAOG ATCTCACGCC TCTGAAGCTG 
DOLL QAG A R L CPTV QLE DIR NLQD LTP LKL 



Jfrtobl2c4gl,- #321 
1 f rtiob012c04s_ #321 
Jfihob012c04iu. #321 
|frhocl2c4h3... §313 
) Oenbank AA9- #279 
\ JtKLa029clOtl_ #67 
| Genbank W386- 165 
9 Oenbank AA3 W >«!> 



OCCGCCAAOQ 
OCCGCCAAGO 
OCCGCCAAOO 
OCCGCCAAGO 
OCCGCCAAOO 
OCCGCCAAOO 
OCCGCCAAGO 



AGGGCAAGNT 
AGGGCAAOAT 
AOGOCAAGAT 
AGGGCAAOAT 
AGGGCAAOAT 
AGGGCAAOAT 
AOGOCAAGAT 



CGANATTTTC 
CGAOATTTTC 
CGAGATTTTC 
COAGATTTTC 
CGAGATCTTC 
COAGATTTTC 
CGAOATTTTC 



AGGCACATCC 
AGGCACATCC 
AGGCACATCC 
AGGCACATCC 
AGGCACATCC 
AGGCACATCC 
AGGCACATCC 



TOCA 

TGCAGCGGGA 
TGCAGCGGGA 
TGCAGCGGGA 
T 

TGCAGCGGGA 
TGCAGCGGGA 



GTTTTCAGOA CTGAGCCACC TTTCCCOAAA OTTCACCGAG 
GTTTTCAOGA CTGAGCCACC TTTCCCOAAA GTTCACCGAO 
GTTTTCAOOA CTGAGCCACC TTTCCCOAAA OTTCACCGAG 

GTTTTCAOGA CTGAGCCACC TTTCCCOAAA OTTCACCGAG 
GTTNTCAGGA CTOAOCCACC TTTCCCOAAA OTTCACCGAG 
TTCAGGA CTGAGCCACC TTTCCCOAAA OTTCACCGAG 



#1261 OCCGCCAAGO AOGOCAAGAT CGAOATTTTC AGGCACATCC TGCAGCGGGA GTTTTCAOOA CTGAGCCACC TTTCCCOAAA OTTCACCGAG 
AAKB OKI EIP RHIL QRB FSG LfiHL SRK PTE 



4ft f Ib0b012c04s~ #411 
4SfxfaobO12c04s« #411 
«f|fit>ocl2c4h3.- #403 
4ft jthLa029cl0t- #157 
4ft Oenbank W38- #155 
4ft Oenbank AA35-. #38 
4ft frtaobUc4cl ... >#1> 
4ft Oenbank N24- >#1> 



TGGTQCTATG 
TGGTGCTATG 
TGGTOCTATG 
TGGTGCTATG 
TGGTOCTATG 
TGGTGCTATG 



QGCCTGTCCG 
GGCCTGTCCG 
GGCCTGTCCG 
GGCCTGTCCG 
GGCCTGTCCG 
GGCCTGTCCG 
GGCCTGTNCG 



GGTGTCGCTG 
GGTGTCGCTG 
GGTGTCGCTG 
GGTGTCOCTG 
GGTGTCGCTG 
GGTGTCGCTG 
GOTO TCNCTG 



TATGACCTGG 
TATGACCT ; G 
TATGACCTGG 
TATGACCTGG 
TATGACCTGG 
TATGACCTGG 
TATGACNTGG 



CTTCTGTGGA 
CTTCTOTOGA 
CTTCTGTGGA 
CTTCTGTGGA 
CTTCTGTGGA 
CTTCTNTGGA 
CTTCTGTONA 



CAGCTOTGAG 
CAGCTGTOAG 
CAGCTGTOAG 
CAGCTOTGAG 
CAGCTGTOAG 
CAGCTOTNAG 
CAGCTGTGAN 



G AO AACTC AG 
OAGAACTCAA 
GAGAACTCAG 
OAOAACTCAO 
OAOAACTCAO 
GAGAACTCAG 
GAGAACTCAG 
OAOAACTCAO 



TOCTOGAGAT 
TGCTGGAGAT 
TOCTOGAGAT 
TOCTOGAGAT 
TOCTOGAGAT 
TGCTGGAGAT 
TOCTOGAGAT 
TGCTGGAGAT 



CATTGCCTTT 
CATTGCCTTT 
CATTGCCTTT 
CATTGCCTTT 
CATTGCCTTT 
CATTGCCTTT 
CATTGCCTTT 
CATTGMCTTT 



#1351 TGGTOCTATG GGCCTGTCCG GGTGTCGCTG TATGACCTGG CTTCTGTGGA CAGCTGTOAG OAOAACTCAO TOCTGOAGAT CATTGCCTTT 
WCYG PVR VSL YDLA SVD SCB EMSV LEI IAF 



4JBfrhob012c04s~ #501 
4ft £xtob012c04a~ #501 
^frnocUc4h3.. #493 
4ft jthL&029cl0t_ #247 
4ft Oenbank W38. #245 
4ft oenbank AA3- #128 
^Rfah6bl3o4cl.a-. #81 
4© Oenbank N242~ #31 



CATTOCAAOA 
CATTGCAAGA 
CATTOCAAOA 
CATTGCAAGA 
CATTOCAAOA 
CATTGCAAGA 
CATTGCAAGA 
CATTOCAAOA 



OCCCGCACCG 
ACCCGCACCQ 
GCCCGCACCO 
OCCCGHACCG 
OCCCGCACCG 
OCCCOCACCG 
OHCCGCACCG 
OCCCGHACCG 



ACACCGAATO 
ACACCGAATG 
ACACCGAATO 
ACACCGAATG 
ACACCGAATO 
ACACCOAATG 
ACACNGAATG 
ACACCGAATO 



OTCGNTTTGG 
GTCQNTT TOG 
GTCOTTTTGN 
GTCQTTTTGG 
GTCOTTTTGO 
OTCOTTTTOG 
GTCOTTTTGO 
GTCOTTTTGO 



AGCCCCTGAA 
AGCCCCTGAA 
AGCCCCTGAA 
AGCCCCTOAA 
AGCCCCTGAA 
AGCCCCTOAA 
AOCCCCTOAA 
AGCCCCTGAA 



CAAACTOCTG 
CAAACTGCTG 
CAAACTOCTG 
CAAACTGCTO 
CAAACTOCTG 
CAAACTGCTG 
CAAACTOCTG 
CAAACTGCTG 



CANGCGAAAT 
CAGGCGAAAT 
CANGCGAAAT 
CAGGCGAAAT 
CAGGCGAAAT 
CAONCGAAAT 
CAOOCOAAAT 



G GO ATCTGC T 
OOOATCTOCT 
GOGATCTOCT 
OOOATCTOCT 
OGGATCTGCT 
G GO ATCTGC T 
GOGATCTOCT 



CATjCCCAAG 

CATCCCCAA 

CATKCNCAAG 

CATCCCCAAG 

CATCCCCAAG 

CATCCCCAAG 

CATCCCCAAG 



#1441 CATTGCAAGA OCCCGCACCG ACACCGAATG GTCGTTTTGG AGCCCCTOAA CAAACTGCTO CAGGCGAAAT OOOATCTOCT CATCCCCAAG 
HCKS PHR HRM VVLB PLN KI-L QAKW DLL IPK 



21ella 

Sequgreher" 1 w 21ellraoefinal 1 ' 



9| AchrtQlell . #425 
9exbobl2c4<Sl._ >#1> 



#2431 



OGCACTAAGC CAGATGGCAG CCCGGATGAG CGCTGGTGCT TCAGGGTGGA GGAGGTGAAC TGGGCTTCAT GGGAGCAGAC GCTGCCTACG 

TGAT GOO AG GAG AC GCTGCCTACG 



OGCACTAAGC CAGATGGCAG CCCGGATGAG CGCTGGTGCT TCAGGGTGGA GGAGGTGAAC TGGGCTTCAT GGGAGCAGAC GCTGCCTACG 
GTKF DOS FOB RWCF RVE B V N W A fl W EQT LPT 



| Achlb21oll „ #515 CTGTGTGAGG ACCCGTCAGG GGCAGGTGTC CCTCGAACTC TCGAGAACCC TGTCCTGGCT TCCCCTCCCA AGGAGGATGA GGATGGTGCC 
Urt**>12c4dl.a~ #25 CTGTGTGAGG ACCCGTCAGG GGCAGGTGTC CCTCGAACTC TCGAGHACCC TGTCCTGGCT TCCCCTCCCA AGGAGGATGA GGATGGTGCC 



#2521 CTGTGTGAGG ACCCGTCAGG GGCAGGTGTC CCTCGAACTC TCGAGAACCC TGTCCTGGCT TCCCCTCCCA AGGAGGATGA GGATGGTGCC 
L C E D P S G A G V PRTL E N P VLA S P P K E D E D G A 

I ACbrb21ell _ #605 TCTG AGG AAA ACTATGTGCC CGTCCAGCTC CTCCAGTCCA ACTGATGGCC CAGATGCAGC AGGAGGCCAG AGG AC AG AG C AGAGGATCTT 
|frh0fol2c4dl.- #115 TCTG AGG AAA ACTATGTGCC CGTCCAGCTC CTCCAGTCCA ACTGATGGCC CAGATGCAGC AGGAGGCCAG AGGACAGAGC ANAGGATCTT 

#2611 TCTG AGG AAA ACTATGTGCC CGTCCAGCTC CTCCAGTCCA ACTGATGGCC CAGATGCAGC AGGAGGCCAG AGGACAGAGC AGAGGATCTT 
SEEN YVP V Q L L Q S N , W P R C S RRPE DRA E D L 

+ 

j Acbxb21ell „ #695 TCCAACCACA TCTGCTGGCT CTGGGGTCCC AGTGAATTCT GGTGGCAAAT ATATATTTTC ACTAACTCAA AAAAAAAAAA AAAAA 
Jfrtl6bl2c4<ll.- #205 TCCAACCACA TCTGCTGGCT CTGGGGTCCC AGTGAATTCT GGTGGCAAAT ATATATTTTC ACTAACTCAA AAAAAAAAAA AAAAAAAAAA 



#2701 TCCAACCACA TCTGCTGGCT CTGGGGTCCC AGTGAATTCT GGTGGCAAAT ATATATTTTC ACTAACTCAA AAAAAAAAAA AAAAAAAAAA 
SNHZ C W L HOE SEFW W Q I Y I F TUSK K K K K K K 



!CHiabl2c4(XL. w #295 AAAAAAAAAA AAAAAAAAA 



#2791 AAAAAAAAAA AAAAAAAAA 
K K K K K K 



BLASTP vs. PNU (AA) f lh21ellorf aa - 5:26:57 pm on Dec 21 

BLASTP 1.4.10MP-WashU [30-Aug-96] (Build 20:24:58 Oct 21 1996] 

Reference: Altschul, Stephen F,, Warren Gish, Webb Miller, Eugene 
and David J. Lipman (1990). Basic local alignment search tool. J 
215:403-10. 

Query= f lh21ellorfaa 
(764 letters) 

Database : /diskQ /asap/databases /db/NRP/protxnu 

345,099 sequences; 106,000,194 total letters- 
Searching done 



Sequences producing High-scoring Segment Pairs: 



GP:gi 
GPU:gi 
GP:gi' 
GP:gi 
GP:gi 
GP:gi 
GP:gi 
GP:gi 
SP:SP: 
GP : gi I 



2570933 
|3986159 
2911863 
3675319] 
2642590 
2854143 
3879753 
3287188 
sp|P4899 
1841966 



(AF029310) vanilloid receptor subtype 1 [Ra. 
|gnl|PID|dl035925 (AB015231) VRlsk [Mus muB. 
(AF047 66I0) contains similarity to ankyrin r. 
gnl|PID|el3 44970 (274030) similar to ankyri . 
(AF031408) olfactory channel [Caenorhabditi , 
(AF045639) contains similarity to ankyrin r. 



gnl PID|el349345 (Z72514) Similarity to Hum- 
gnl PlD|e315126 (Y10601) ankyrin-like prote. 

4|trpl_drome transient-receptor-potential l. 

(U65916) ankyrin [Rattus norvegicus] 



98 



W . Myers , 
Mol. Biol. 





Smallest 






Sum 




High 


probability 


Score 


P(N) 


N 


336 


l,9e-221 


7 


299 


9.8e-136 


6 


103 


6.5e-20 


8 


97 


l.le-19 


7 


93 


3 .6e-16 


6 


93 


4.1e-16 


6 


73 


3 . 6e-ll 


6 


71 


2 -7e-08 


6 


69 


2.2e-05 


4 


72 


5.6e-05 


3 



WARNING : Descriptions of 4 database sequen.ras were not reported due to the 
limiting value of parameter V = 11 . 



>GP:gi (2570933 (AF029310) vanilloid receptor subtype 1 [Rattus norvegicus] 
Length = 838 

Score = 336 (156.0 bits). Expect ^ 1.9e-221, Sum P(7] = l-9e-221 
Identities = 67/112 (59%), Positives = 84/112 (75%) 



Query: 
Sbjct : 
Query: 
Sbjct: 



206 FYFGELPLSLAACTKQWIWVSYLLENPHQPASI^ATDSQGNTVLHALVMISDNSAENIAL 265 

FYFGELPLSLAACT Q +V +LL+N QPA + A DS GNTVLHALV ++DN+ +N 
245 F YFGELPLS LAAC TNQ LA I VKF LLQNS WQF AD I SARDS VGNTVLHALVEVADNTVDNTKF 304 



266 VTSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKLAAKEGKIEIFRHILQRE 
VTSMY+ +L GA+L PT++LE+1 N + LTPL LAA GKI + +ILQRE 
305 VTSMYNEILtLGAKLHPTLKLEEITNRKGLTPLALAASSGKIGVLAYILQRE 



317 
356 



Score = 316 (146.7 bits)/ Expect = 1.9e-22^. Sum P(7) = 1.9e-221 
Identities = 65/138 (47%), Positives = 93/:,38 (67%) 

Query: 66 ASQPDPNRFDRDRLFNAVSRGVPEDI^GLPEYLSKTSKY^ 125 

A + P +DR +F+AV++ ++L L : +L ++ K LTDSE+ + TGKTCL+KA+L 
Sbjct: 104 AGEKFPRLYDRRSIFDAVAQSNCQELESLLPFLQRSKKRLTDSEFKDPETC^ 163 

Query: 126 NLKDGVNAC ILPLLQI DRDSGWPQPLVNAQCTDDYYRGHSALHI AIEKRS LQQVKLLVEN 185 

NL+GN I LL + R + + + VNA TD YY+G +ALHIAIE+R-H- |V LLVEN 
Sbjct: 164 NLHNGQNDT I ALLLDVARKTDS LKQ FVNAS V5TDS YYKGQTALH I AI ERRNMT EjVTLLVEN 223 

Query; 186 GANVHARACGRFFQKGQG 203 



I 



GA+V A A G PF+K +G 
Sbjct: 224 GADVQAAANGDFFKKTKG 241 

Score = 299 (138.8 bits), Expect = 1.9e-221, Sum P(7) = l_9e-221 
Identities = 69/175 (39%), Positives = 94/175 (53%) 



Query: 
Sbjct: 
Query: 
Sbjct : 
Query: 
Sbjct ; 



391 FFLNFLCNLIYMFlFTAVAYHQPTLKKAAPHIiKAEVGNSMLLTGHILILLGGDCYLLVGQL 450 



F+ NF 



+YM IFTA AY++P 



LK VG+ 



+TG IL 



434 FYFNFFVyCLYMIIFTAAAYYRPVEGLPPYKLKNTVGOT 493 
451 WYFWRRHVTIWISFIDSYFEILFLFQALLKPVSQVLCFIA^ 510 



YF +R 



F+DSY EILF Q+L VS VL F + Y+ +V +L 



+GW N+L 



494 QYFLQRRPSLKSLFVDSYSEILFFVQSLFi'LVSVvX^ 553 

511 YYTRGFQHTGIYSVMIQKVILRDLIJIFLLIYLVFLFGFAVALVSLSQEAWRPEAP 565 

YYTRGFQ GIY+VMI+K+ILRDL R A+V+L ++ | P 

554 YYTRGFQQMG I YAVMI EKMILRDLCRXXXXXXXKXXXXXTAWT L I EPGKNNSL F 608 



Score = 281 (130.5 bits). Expect = l-9e-221, Sum P(7) = 1.9e-221 
Identities = 54/95 (56%), Positives = 70/95 (73%) 

Query: 586 GAQYRGILEASLELFKFTIGMGELAFQEQLHFRGMVT/LLLLA^ 645 

G Y + LELFKFTIGMG+L F E F+ + ++LLIiAYV+LTYILLLNMLIALM 
Sbjct: 624 GNSYNSLYSTCLELFKFTIGMGDLEFTENYDFKAWIIL^ 683 

Query: 646 ETVNSVATDSWS IWKLQKAISVLEMENGYWWCRKK 680 

ETVN +A +S +IWKLQ+AI++L+ E C +K 

Sbjct: 684 ETV^IAQESKNIWKLQRAITILOTEKSFLKCMRK 718 

Score =.162 (75.2 bits). Expect = 1.9e-221, Sum P(7) = 1.9e-221 
Identities = 28/38 (73%), Positives = 34/:* 's (89%) 

Query: 323 HLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEIIAF 360 

HLSRKFTEW YGPV SLYDL+ +D+CE+NSVLE+IA+ 
Sbjct: 364 HLSRKFTEWAYGPVltSSLYDLSCIDTCEKIiSvXEVIAY 401 

Score = 144 (66.9 bits), Expect = 1.9e-221, Sum P(7) = 1.9e-221 
Identities = 25/47 (53%), Positives « 31/47 (65%) 

Query: 678 RKKQRAGVMLTVGTKPDGSPDERWCFRVEEVNWASWEQTLPTLCEDP 724 

RK R+G +L VG PDG D RWCFRV+EVNW +W + + EDP 
Sbjct: 717 RKAFRS GKLLQVGFTPIXjKDDY11WCFRvI)E VNWTTWNTNVG I INED P 763 



Score = 94 (43.6 bits), Expect = 1.96-221, Sum P(7) 
Identities = 15/30 (50%), Positives = 24/30 (80%) 

Query: 3 63 KSPHRHRMWLEPIiNKLLQAKWDLLIPKFF 392 

++P+RH M+++EPLN+LLQ KWD + + F 
Sbjct: 405 ETPNRHDMLLVEPLNRl^l^DKl/TORFVKRIF 434 

Score = 46 (21.4 bits), Expect = 1.3e-162. Sum P(6) 
Identities = 11/48 (22%), Positives = 24/43 (50%) 



Query; 



= 1.9e-221 



= 1.3e-162 



435 HILILLGG I YLL VGQLWYFWRRHVF I WI S T IDS YFE I LFLFQ ALUCPV 482 
H + L+ W + + +F + PJ- + I+F A +PV 

Sbjct: 410 HDMLLVEP LNRLLQDKWDRFVKRI FYFNFF VYCLYM 1 1 FT AAAYYRPV 457 

Score = 37 (17.2 bits), Expect = 2.4e-169, Sum P{7) = 2.4e-169 
Identities = 8/22 (36%), Positives = 13/22 (59%) 

Query: 297 PLKLAAKEGKIEIFRHII>QREF 318 

PL LAA ++ I + +LQ + 
Sbjct: 251 PLSLAACTNQLAIVKFLLQNSW 272 



Score = 37 (17.2 bits). Expect = 1.9e-lll, Sum P(7) = 1.9e-lll 
Identities = 8/26 (30%), Positives = 15/26 (57%) 



Query; 206 FYFGELFLSLAACTKQWDVVSYLLEN 2 31 

+Y G+ L +A + +V+ L+EN 
Sbjct: 198 YYKGQTALHIAIERRNMTLVTLLVEN 223 

>GPU:gi| 3986159 |gnl|FID|dl035925 (AB015231) VRlsk [Mus musculus] 
Length = 563 1 

Score = 299 (138.8 bits). Expect = 9.8e-136, Siim P(6) = 9.8e-136 
Identities = 69/175 (39%), Positives = 94/175 (53%) 



Query; 
Sbjct: 
Query; 
Sbjct: 
Query: 
Sbjct; 



391 FFLNFLCNLIYMFIFTAVAYHQPTLKKAAPHLKAEVGNSMLLTGH^ 450 



F+ NF 



+YM IFTA AY++F 



LK VG+ +TG IL + 



127 FYFNFFVYCLYWIIFTAAAYYRPVEGLPPYKLKNTVGDYFRVTGEI^ 186 
451 WYFWRRHVF IWI SF I DS YFE I LF LFQALLKPVSQ VLCFLAI EWYL PLLVSALVjLGWLNLL 510 



YF +R 



F+DSY EILF Q+L VS VL F + y+ +V +L 



187 QYFJ^RRPSLKSLFVDSYSEILFFVQSLFMLVSWLY^ 246 
Sll YYTRGFQHTtelYSVMIQKV"ILRDLLRFLLlYLVFLFGFAVALVSLSQEAWRPE?VP 565 



YYTRGFQ GIY+VMI+K+ ILRDL R 



A+V+L ++ 



4-GW N+L 



247 YYTRGFQQMGIYAVMIEKMILRDLCRXXXXXXXXXXXXXTAWTLlEDGKNNSfljP 301 



Score = 253 {117.5 bits), Expect = 9.8e-136, Sum P(6) = 9.8e-136 
Identities = 50/89 (56%). Positives = 64/89 (71%) 

Query: 586 GAQ YRG I LEAS LELFKFTI GMG ELAFQEQLHFRG^LLLLLAYVLLTY I LLLNML I ALMS 645 

G Y + LELFKFTIGMG+L F E F+ + ++LLLAW+LTYILLLNJ*LIALM 
Sbjct: 317 GNSYNSLYSTCLELFKFTIGMGDLEFTEKnraFKAVFIILLLAWI^ 37 6 

Query; 646 ETVNS VATDSWS IWKLQKAIS VLEMENGY 674 

ETV V+ +S IWKLQ A ++L++E + 
Sbjct:: 377 ETVGQ VS KES KHI WKLQWATT I LD I ERS F 405 

Score = 163 (75-7 bits), Expect = 9.8e-136, Sum P(6) = 9.8e-136 
Identities = 28/47 (59%). Positives = 34/47 (72%) 

Query: 678 RKKQRAGVKLTVGTKPDGSPDERWCFRVEEVNWASWEQTLPTLCEDP 724 

RK R+G M+TVG DG+PD RWCFRV+EVNW+ W Q L + EDP 
Sbjct: 410 RKAFRSGEMVTVGKSSDGTPDRRWCFRVTDE^TNWSHWNQNLGIINEDP 456 

Score = 162 (75.2 bits), Expect = 9.8e-136, Sum P(6) = 9.8e-136 
Identities = 28/38 (73%), Positives = 34/3 * (89%) 

Query: 323 HLSRKFTEWCYGPVRVSLYPIASVDSCEENSVLEIIAF ^3 60 

HLSRKFTEW YGPV SLYDL+ +D+CE+NSVLE+IA+ 
Sbjct: 57 HLSRKFTEWAYGPVHSSLYDLSCIDTCEKNSVLEV1AY 94 

Score = 129 (59-9 bins), Expect = 9,8e-136, Sum P(6) = 9.8e-136 
Identities = 27/49 (55%), Positives = 36/49 (73%) 

Query: 269 MYTCLLQAGARLCPTVQLEDIRNLQDLTPLKLAAKEGKIEIFRHILQRE 317 

MY+ +L GA+L PT++LE+I N + LTPL LAA GKI + +ILQRE 
Sbjct: 1 MYNEILILGAKLHPTLKLEEITNRKGLTPLALAASSGKIGVLAYILQRE 49 

Score = 94 (43.6 bits), Expect = 9.8e-136, Sum P(6) = 9.8e-136 
Identities = 15/30 (50%), Positives = 24/30 (30%) 



Query: 363 KSPHRHRMVVLEPLNKLLQAKWDLLIPKFF 392 



++P+RH M+++EPLN+LLQ KWD + + F 
Sbjct: 96 ETPNRHDMLLVEPLNRLLQDKWDRFVKRir 127 

Score = 46 (21.4 bits), Expect - 2.3e-77, 5um P(S) = 2.3e-77 
Identities - 11/48 (22%), Positives = 24/48 (50%) 

Query: 435 HILILLGGIYLLVGQLWYFWRRHVFIWISFIDSYFEILFLFQALLKPV 482 

H + L+ W + + +F + F+ + I+F A +PV 

Sbjct: 103 HDMLLVEPLNRLLQDKWDRFVKRIFYFNFFVYCLYMI 150 

Score = 42 (19.5 bits). Expect = 1.2e-122, Sum P(6J = 1.2e-122 
Identities = 9/22 (40%) , Positives = 16/22 (72%) 

Query; 209 GELPLSLAACTKQWDWSYLLE 23 0 

G PL+LAA + + V++Y+L+ 
Sbjct: 26 GLTPLALAASSGKIGVLAYILQ 47 



>GP--gi|2911863 (AF047660) contains similarity to ankyrin repeats 
[Caenorhabditis elegans] 
Length =900 

Score = 103 (47. B bits), Expect = 6.5e-20 ; Sum P(8) = 6.5e-20 
Identities = 23/89 (25%), Positives = 44/39 (49%) 

Query- 497 liVSALVLGWIM^ 556 

L+ ++ ++ LYY R + G + +M+ +1 D+ RF+LIY +FL GF+ + + 
Sbjct: 592 LITWMIFTTVHYLYYCRVIRFVGPFVLWYTIIA 651 

Query: 557 QEAWRPEAPTG PNATESVQPMEGQEDEGN 5S5 

R T+ + EG +++ N 

Sbjct: 652 LSCEREAltfVI KKL ITDQSEAS EGS DNKFN 680 

Score = 63 (29.3 bits). Expect = 6.5e-20, Sum P{8) = 6.5e-20 
Identities = 15/27 (55%), Positives = 18/27 (66%) 

Query: 291 NLQDLTPLKIAAKEGKIEIFRHILQRE 317 

N Q L+PL LAAK K E4-F IL+ E 
Sbjct; 328 NKQSLSPLTLAAKLAKKEMFDEILELE 354 

Score = 56 (26,0 bits). Expect: = 6.5e-20 f Sum P<8) = 6.5e-20 
Identities = 16/48 (33%), Positives = 25/^ (52%) 

Query: 151 LVNAQCTDDYYKGHSALHlAIEKRSLQCVt.LLVENGANVH^ 198 

L+N + + G S LH AI + V. ++ GA+V++R G FF 

Sbjct: 185 LLNDIHI SEDF YGLS PLHQAI XNTDCKL V^KFLKLGADVNSRCYGAFF 232 

Score = 54 (25.1 bits), Expect = 6.5e-20, Sum P(8) = 6.5e-20 
Identities « 12/23 (52%), Positives = 13/23 (56%) 

Query: 207 YFGELPLSI-AACTKQWDWSYLL 229 

Y GE PLS AAC Q + LL 
Sbjct: 263 YLGEYPLSFAACLNQPESFRLLL 285 

Score = 49 (22.8 bits), Expect = 6,5e-20, Sum P(8) = 6-5e-20 
Identities = 8/23 (34%), Positives * 12/23 (52%) 

Query: 331 WCYGPVRVSLYDLASVDSCEENS 353 

W YG + Y LA +D+ E + 
Sbjct: 359 WAYGDASSTAYP LAKIDT INETT 381 



Score = 47 (21.8 bits). Expect = 6.5e-20. .*3um P(8) = 6.5e-20 
Identities = 11/30 (36%), Positives = 18/i:J (60%) 



I 

I 



Query: 23 6 AS LQ ATDS QGNTVLHAL VM I S DNSAENIAL 265 

A+ A D+ GN+VLH V+ + + +AL 
Sbjct: 289 ANPNAQDTNGNSVLHMCVIHENMAMFKLAL 318 

Score = 46 (21.4 bits), Expect = 5.7e-13, Sum P(6) = 5.7e-13 
Identities = 9/27 (33%), Positives = 17/27 (62%) 

Query; 163 GHSALHIAIEKRSLQCVXLLVENGANV 189 

G+S LH+ + ++ KL +E GA++ 
Sbjct: 298 GNSVLBMCVIHENMAMFKLALECGASL 324 

Score = 43 (20.0 bits], Expect = 6.5e-20, Sum P(B) = 6,5e-20 
Identities = fi/18 (44%), Positives = 13/18 (72%) 

Query; 634 ILLLNMLIAIrfMSETVNSV 651 

1+ NMLIA+M+ T ++ 
Sbjct: 753 IMQFNMLIAMMTRTYETI 770 

Score = 41 (19-0 bits). Expect = 6.5e-20, Sum P(8) = 6.5e-20 
Identities = 10/27.(37%). Positives = 16/27 (59%) 

Query: 113 GSTGKTCLMKAVLNLKDGVNACILPLL 139 

GS G+T + +L+ D NA +L +L 
Sbjct: 153 GSMGETIIGCCLLHASDIHNALVIiKIL 179 

Score = 35 (16.3 bits). Expect = 0.00016, Sum P(3) = 0.00016 
Identities = 9/37 (24%), Positives = 16/37 (43%) 



Query: 67 SQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSK 103 

S+P P R+ ++ V + GL E+ S+ 

Sbjct: 464 SEPFPGRYGKNSTLQQVKPVINATSRGLVEWSEPLSQ 500 

Score = 34 {15.8 bits). Expect = 6.8e-ll, Sum P(8) = 6.8e-ll 
Identities = 8/31 (25%), Positives = 17/31 (54%) 

Query; 422 LKAEVGNSMLLTGHILILLGGIYLLVGQLWY 452 

L ++ +L+ ++LI + 1+ V L+Y 
Sbjct: 577 I^CDLSPVLLVVBNVI.ITVlMIFTTVHYLYi; 607 



>GP;gi| 3875319 |gnl| FID |el344970 (Z74030) similar to ankyrin repeats 

[Caenorhabditis elegans] >GP: gi | 3876480 | gnl | PID | el3461j7 2 (Z72508) 
similar to ankyrin repeats [Caenorhabditis elegans] 
Length = 790 

Score = 97 (45.0 bits), Expect = l.le-19, Sum P(7) = l.le-19 
Identities = 18/53 (33%), Positives = 32/53 (60%) 



Query: 497 LLVSALVLGWLNLLYYTRG 

+ + +++L + LYY R G + +M+ +1 DL+RF +IY +FL GF+ 

Sbjct; 526 MAI IS ILLVTQHFLYYMRAI PFVGPFVLMVYTI IATDLVRFAMI YSIFLVGFS 



Score = 89 (41,3 bits), Expect « l.le-19, Sum P(7) 
Identities = 19/48 (39%), Positives = 27/48 (56%) 



l.le-19 



Query; 151 LVNAQCTDDYYRGHSALHIAIEKRSLQCVKLLVENGANVHARACGRFF 198 

L+N C + Y G S LH+Al + Q LL+ GA+++ R G FF 
Sbjcr: 189 LIOTICVSEEYYGLSPLHIAIWQDAQFTSVLiLRLGADIiNQRCYGAFF 236 
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578 



Score = 62 (28.6 bits), Expect = l.le-19. Sum P(7) = l.le-19 
Identities = 13/23 (56%), Positives = 14/23 (60%) 



Query; 207 YFGELPLSLAACTKQWDWSYLL 229 

YFGE PLS AC Q D+ LL 
Sbjct: 267 YFGEYPLSFAICTfGQHDLFRMLL 289 

Score = 51 (23.7 bits), Expect = l.le-19, Sum P(7) = l.le-19 
Identities = 10/19 (52%), Positives « 13/19 (68%) 

Query: 236 ASLQATDSQGNTVLKALVM 254 

A+L A D+ GNT LH V+ 
Sbjct: 293 ANLSAQDTNGNTALHLCVI 311 

Score = 49 (22.8 bits-), Expect = l.le-19. Sum P(7) = l.le-19 
identities = 11/51 (21%), Positives - 27/51 (52%) 

Query: '601 KFTIGMGELAFQEQLHFRGMVLLLLIJm/LLTYILLL^^ 651 

+F+ + E++ + + + L+ + + IL N+LIA+M+ T ++ 

Sbjct: 626 EFSVLYRE^SACDNFWMKWlGKLIFVIFETi'VSILQFNLLIAMMTRTYETI 676 

Score = 43 (20.0 bits), Expect - 1.2e-06, Sum P(5) = l,2e-06 
Identities = 7/27 (25%), Positives ~ 16/27 (59%) 

Query: 163 GHSALHIAIEKRSLQCVKLLVENGANV 189 

G++ALH+ + + + ++E G N+ 
Sbjct; 302 GNT ALHLCVT HDKMDMLDAVLEAGGNI 328 

Score = 39 (18.1 bits), Expect = 5-5e-12, Sum P(7) = 5,5e-12 
Identities = 12/56 (21%), Poaitives = 25/56 (44%) 



Query: 
Sbjct: 



461 WI S F I DS YFE I LFliFQ ALLKP VSQVXjCFIAIEV^LPLL VS AL VLGWLNLLYYTRGF 516 



W +F+ ++ 



A If 



+ C LA ++ 



+++L F 



483 WFNFLKAFPAKLMFKGAFLF 1 1 IS IPCRLACSFHEFFLTIDNTMAl IS I LLVTQHF 538 



Score - 38 (17.6 bits), Expect - l.le-19, Sum P(7) = l.le-19 
Identities = 9/18 (50%), Positives = 12/18 .{66%) 

Query: 289 IHNLQDLTPLKLAAKEGK 306 

+ N Q+LT L LAA+ K 
Sbjct: 330 LANKQNLTALTLAARLAK 347 

Score = 37 (17.2 bits), Expect = 4.7e-18, Sum P(7) = 4.7e-18 
Identities = 7/15 (46%), Positives = 9/15 (60%) 

Query: 723 DPSGAGVPRTLENPV 737 

DP G+ +ENPV 
Sbjct: 599 DPMG S EFNNTMENPV 613 

Score - 36 (16,7 bits), Expect = l.le-19, Sum P(7) = l.le-19 
Identities = 10/d7 (21%), Positives = 16/47 (34%) 

Query; 376 LNKLLQAKWDLLIPKFFLNFLC^IYMFIFTAVAYHQPTLKKAAPHL 422 

+ ++L KW +L L 1+ + AY HL 

Sbjct: 362 IEQILDEKWKAYGRALWLRSLLGFIFF^CCFVCAYMLRPSSATTEHL 408 

Score = 35 (16.3 bitsj , Expect = 3.0e-08, fvm P(6) = 3.0e-08 
Identities « 9/23 (39%), Positives = 11/23 ;47%) 

Query: 207 YFGELPLSLAACTKQWDWSYLL 229 

Y+G PL LA + S LL 

Sbjct: 199 YYGLSPLHLAIVNQDAQFTSLLL 221 

Score « 34 (15,8 bits), Expect = 3.9e-12, Sum P(5) = 3.9e-12 
Identities = 7/21 (33%), Positives = 13/21 (61%) 



Query: 297 PLKLAAKEGKIEIFRRTLQRE 317 

. PL A G+ ++FR +L ++ 
Sbjct: 272 PLSFAICMGQHDLFRMLLAKK 292 

Score = 34 (15.8 bits), Expect = 8.9e-10, Sum P(7) = 8.9e-10 
Identities = 6/18 (33%), Positives = 12/18 (66%) 

Query: 610 AFQEQLHFRGMVLLLLLA 627 

AF +L F+G L ++++ 
Sbjct: 489 AF P AKLMFKGAF LFI I I S 506 

Score = 34 (15-8 bits), Expect = a.Oe-OS, Sum P(6) = 4.0e-08 
Identities = 11/40 (27%) # Positives = 16/40 (40%) 

Query; 153 NAQCTDDYYRGHSALHIAIEKRSLQCVKLLVENGANVHAR 192 

N T Y G LAI ++L+ AN+ A+ 

Sbjct: 259 NTNYTG SMYFGEYPLS F AI CMGQHDLFKMLLAKKANLS AQ 298 



>GP:gi (2642590 (AF031408) olfactory channel [Caenorhabditia elegaris] 
Length =937 

Score = 93 (43.2 bits), Expect = 6.8e-16, Sum P(5) = 6.8e-l6 
Identities = 19/36 (52%), Positives = 25/36 (69%) 

Query: 163 GHSAI^IAIEKRSLQCVKLLVENGANVHARACGRFF 19 fl 

G SALH+AI + V LL+ + A+V+ARACG FF 

Sbjct: 172 GQSALHLAIVHDDYETVSLLLNSKADVNARACGNFF 207 

Score = 92 (42.7 bits), Expect = 6.8e-16, Sum P{5) = 6.8e-16 
Identities = 20/53 (37%), Positives = 31/53 (58%) 



Query: 497 LLVSALVLGWLNLLYYTRGFQHTGIYSVMIQKVILRDLLI^LLIYLVFLFGFA 
L V AL W+ LL+ + R + TG + MI +1 D++RF +1 +FL F+ 
Sbjct : 500 LFWALFGSWIFLLFFARSAKLTGPFVQMIYSMIAGEnyilRFAIISAIFLVSFS 

Score = 57 (26.5 bits). Expect = 6.8e-l6, Sum P(5) = 6.8e-16 
Identities = 10/24 (41%), Positives = 15/24 (62%) 

Query: 207 YFGELPLSLAACTKQWDVVSYLLE 2 30 

Y+GE PL+ AAC D+ L++ 
Sbjct; 226 YYGEYPLAFAACFGNKDlYDLLIQ 249 

Score = 50 (23,2 bits). Expect = 6.8e-16, Sum P{5) = 6.8e-16 
Identities = 11/25 (44%), Positives = 15/25 (60%) 

Query; 291 NLQDLTPLKLAAKEGKIEIFRHILQ 315 

N TPL LA K G+ +IF +L+ 
Sbjct: 294 NHAGFTF LTLATKLGRKQ I FEEMLE 318 

Score = 44 (20,4 bits). Expect = 6,8e-l6, Siim P(5) = 6.8e-16 
Identities = 10/24 (41%), Positives = 15/24 (62%) 

Query: 242 DSQGNTVLHALVMISDNSAENIAL 2 65 

DS GNT+LH V+ . +S + A+ 
Sbjct: 259 DSFGNTILHMCVINYSSSMYSYAV 281 

Score = 37 (17.2 bits). Expect = 2.2e-05, Sam P(4) = 2.2e-05 
Identities = 9/24 (37%), Positives = 13/24 (54%) 
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Query: 2 08 FGELPLSLAACTKQWDWSYLLEN 231 

FG+ L LA ++ VS LL + 

Sbjct: 171 FGQSALHLAIVHDDYETVSLLLNS 194 



Score = 34 {15.8 bits). Expect = 3.6e^l6, Sum P(6) = 3.6e-l6 
Identities = 7/21 (33%), Positives * 11/21 (52%) 

Query: 364 SPHRHFMVVLEPLTIKLLQAKW 384 

+ P M+ E + +LL KW 
Sbjct: 3 63 TPEHLDMIGSEVIQRLLADKW 383 



>GP:gi ] 2854148 (AF045639) contains similarity to ankyrin repeats 
[Caenorhabditis elegans] 
Length = 957~ 

Score = 93 (43.2 bits). Expect = 7.6e-l6, Sum P{5) = 7,6e-16 
Identities = 19/36 (52%), Positives = 25/36 (69%) 

Query; 163 GHSALHIAIEKRSLQCVKLLVENGANVHARACGRFF 198 

G SALH+AI + V LL+ + A+V+ARACG FF 

Sbjct: 175 GQSALHLAIVHDDYETVSLLLNSKADVNARACGNFF 210 

Score = 92 {42.7 bits), Expect = 7.6e-16, Sum P{5) = 7.6e~16 
Identities = 20/53 (37%), Positives = 31/53 (58%) 

i 

Query: 497 LLVSALVI/SWL^LYYTRG 

L YAL W+ LL++ R + TG + MI +1 D++RF +1 +FL F+ 
Sbj ct ; 503 LFVFALPGSWIFLLFFARSAKLTGPFVQMIYSMIAGDMIRFAIISAIFLVSFS 

Score = 57 (26.5 bits). Expect = 7.6e-l6, S,im P(5) = 7.6e-16 
Identities = 10/24 (41%), Positives = 15/2-? (62%) 

Query: 207 YFGELPLSLAACTKQWDWSYLLE 230 

Y+GE PL+ AAC D+ L++ 
Sbjct: 229 YYGEYPLAFAACFGNKDIYPLLIQ 252 

Score = 50 (23.2 bits). Expect = 7.6e-16, Sum P(5) = 7.6e-16 
Identities = 11/25 (44%), Positives = 15/25 (60%) 

Query: 291 NLQDLTPLKLAAKEGKIEIFRHILQ 315 

N TPL LA K G+ + IF +L+ 
Sbjct: 297 NHAGFTPLTLATKLGRKQIFEEMLE 321 

Score = 44 (20.4 bits). Expect = 7.6e-l6, Sum P(5) = 7.6s-l6 
Identities = 10/24 (41%), Positives =15/24 (62%) 

Query: 242 DSQGNTVLHALVMISDNSAENIAL 265 

DS GNT-t-LR V+ +S + A+ 
Sbjct: 261 DS FGNTI LHMCVINYS S SHYS YAV 284 

Score - 37 (17.2 bits). Expect = 2.4e-05, Sum P{4) = 2.4e-05 
Identities - 9/24 (37%), Positives = 13/24 (54%) 

Query: 208 FGELPLSLAACTKQWDWSYLLEN 231 

FG+ L LA ++ VS LL + 

Sbjct: 174 FGQSALHLAIVHDDYETVSLLLNS 197 

Score = 34 (15.8 bits). Expect = 4.1e-16, Sum P(6) = 4.1e-16 
Identities = 7/21 (33%), Positives = 11/21 (52%) 
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Query: 3 64 SPHRHRMVVLEPLNKLLQAKW 384 

+P M+ S + +LL KW 
Sbjct: 366 TPEHLDMIGSEVTQRLLADKW 386 



I 



>GP:gi|3879753|gnl|PID|el349345. (Z72514) Similarity to Human ankynn 
(SW:ANKl_HUMAN) [Caenorhabditis elegans] 
Length =519 
i 

Score = 73 (33.9 bits;) , Expect = 3.6e-ll r Sum P{6) = 3.6e-ll 
Identities = 17/36 (47%), Positives = 19/36 (52%) 

Query; 163 GHS ALHI AI EKRS LQ CVTCLL VENGANVHARACGRFF 198 

G S LH AI L+ V L GA+VH R G FF 
Sbjct: 186 GLSPLHQAIVNEDLEMVYFLCRKGADVHQRCYGSFF 221 

Score = 63 (29.3 bits). Expect = 3.6e-ll, Sum P{6) = 3.6e-ll 
Identities = 12/17 (70%), Positives « 13/17 (76%) 

Query: 207 YFGELPLSLAACTKQWD 223 

Y+GE PLS AACT Q D 
Sbjct: 252 YWGEYPLS FAACTNQVD 268 

Score = 52 (24.1 bits). Expect = 3.6e-ll, Sum P£6) = 3.6e-ll 
Identities = 13/27 (46%), Positives = 17/27 (62%) 

Query: 289 IRNLQDLTPLKLAAKEGKIEIFRHILQ 315 

+ RN LTFL LAA+ K 1+ IL+ 
Sbjct: 315 TONT^KLTPI^LAARI^KKHIYDLILE 341 

Score = 51 (23.7 bits). Expect = 3.6e-ll, Sum P(6) = 3.6e-ll 
Identities = 10/22 (45%). Positives = 12/22 (54%) 

Query: 331 WCYGPVWSLYDLASVDSCEEN 352 

W YGPV Y L VD+ E+ 
Sbjct: 348 WRYGPWCKAYPLNDVDTINES 369 

Score - 43 (20_0 bits). Expect = 3.6e-ll, Sum P(6) = 3.6e-ll 
Identities = 8/13 (61%), Positives = 10/13 (76%) 

Query: 242 DSQGNTVLHALVM 254 

D+ GNTVLH V+ 
Sbjct: 284 DTNGNTVLHLTVI 296 

Score = 37 {17.2 bits), Expect = 3-6e-ll, Sum P{6) - 3-6e-ll 
Identities = 8/35 (22%), Positives = 16/35 (45%) 

Query: 376 LNKLLQAKWDLL I PKF F LNFLCNLI YMF I F'? HVAY 410 

+ ++L++KW+ K L IY +A+ 

Sbjct: 398 IEEVIjESKWETFGKKQLFMSIAGYIYFIAWYIAF 432 

>GP;gi|3287188|gnl|PID|e315126 (Y10601) ankyrin-like protein [Homo 
Length = 1119 

Score = 71 (33.0 bits). Expect = 2.7e-08, Sum P(6) = 2.7e~08 
Identities = 13/30 (43%), Positives = 20/30 (66%) 

Query: 163 GHS ALH I A I EKRS LQCVKLLVENGANVHAR 192 

G++ LH A+fcK ++ VK L+ GAN + R 
Sbjct : 98 GNTPLHCAVEKNQIESVKFLLSRGANPNLR 127 

Score = 63 (29.3 bits') , Expect = 2.7e-08, Sum P(6) = 2.7e-08 
Identities =■ 10/30 (33%), Positives = 20/30 (66%) 

Query: 501 ALVLGWIJ^YYTRGFQHTGIYSvMIQKVI 530 

A+ W+N L Y + F++ GI+ VM++ ++ 
Sbjct: 838 AVYFYWMNFLLYLQRFENCGIFIVMLEVIL rt67 



sapiens] 



Score = 59 (27.4 bits), Expect = 9,0e-07, Sum P(6) = 9.0e-07 
Identities = 10/30 (33%), Positives = 19/30 (63%) 

Query: 167 LHI AI EKRS LQC VKLLVENG ANVHARACGR 196 

LH+A++ L+ +K+ ++NGA + GR 
Sbjct: 243 LHLAVQNGDLEMIKMCLDNGAQIDPVEKGR 272 

Score = 50 (23,2 bits), Expect = 2.7e^08, Sum P{6) = 2.7e-08 
Identities = 10/26 (38%), Positives = 18/26 (69%) 

Query: 291 NLQDLTFLKLAAKEGKIEIFRHILQR 316 

+L +TPL LAAK G ++ + +L++ 
Sbjct: 479 DLHGMTPLHLAAKNGHDKVVQIjLLKK 504 

Score = 49 (22,9 bits) , Expect = 2.7e-08, Sum P(6) = 2.7e-08 
Identities ~ 10/26 (38%) , Positives = 14/26 (53%) 

Query: 209 GELPLSLAACTKQWDVVSYLLENPHQ 234 

G PL LA + W++V+ LL Q 
Sbjct: 342 GRSPLILATASASWNIVNLLLSKGAQ 367 

Score - 48 (22.3 bits,). Expect = 3.2e-05, Sum P(5) - 3.2e-05 
Identities ^ 9/32 (26%), Positives = 20/32 (62%) 

Query: 524 VMIQKUTLRTILLRFLLIYLVFLFGFAVALVSL 555 

+++ +VIL+ LLR ++++ L F ++ L 
Sbjct; 860 IVMLEVILKTLLRSTWFIFLLLAFGLSFYIL 891 

Score = 41 (19.0 bits), E^ect = 2.7e-08, Sum P(6] = 2.7e-08 
Identities = 9/28 (32%), Positives = 15/28 (53%) 

Query: 525 MI QKVILRDLLRF LL IYLVF LFGFAVAL 552 

+ 1 K +LR +F+4-LF F + L 
Sbjct: 865 VILKTLLRSTWF IFLLLAFGLSFYI LL 892 

Score = 40 (18.6 bits). Expect = 2_9e-07, Sum P{5) = 2.9e-07 
Identities = 13/50 (26%), Positives = 24/50 (48%) 

Query: 620 lm^LLLAYVLLTYILLLNMLIAI^SET^ 669 

+V + LL L++ +LLN+ S ++■;■■ T S + + S LE 

Sbjct; 875 VWIFLLLAFGLSFYILLNLQDPFSSPLLSIIQTFSMMLA^ 924 

Score = 40 (18.6 bits'), Expect = 7.6e-05, iSum P(5) = 7.6e-05 
Identities * 8/19 (42%) , Positives = 12/1S (63%) 

Query: 296 TPLKLAAKEGKIEIFRHIL 314 

TPL LA + G +E+ + L 
Sbjct: 241 TPLHLAVQNGDLEMIKMCL 259 

Score = 39 (18.1 bits). Expect = 2.7e-08, Sum P(6) = 2.7e~08 
Identities = 7/19 (36%), Positives = 14/19 (73%) 

Query: 625 LLAYVLLTYILLLNMLIAL 643 

L+++ + I+L+N+LI L 
Sbjct: 941 LVSFTIFVPIVLMNLLIGL 959 

Score = 39 (18.1 bits.), Expect = 6.8e-07, Sum P(6) = 6.8e-07 
Identities = 9/36 (25%), Positives = 18/36 ;50%) 



Query: 
Sbjct: 



293 QDLTPLKLAAKEGKIEIFRHILQREFSGLSHLSRKF 326 

+ LT L 1 + +IE+ H + +E+ + L+ F 
681 E PLTALNAMVQNITRIE LLNHPVCKEYLLMKWLAYGF 716 




Qualified Target Summary Sheet 




Vanilloid Receptor Homologue 
(VR-2) 



Draggable Target Class 



Calcium channel 



MPI Gene Sequence Identifier: 



Top Blast Hit: 



Flh21ell (Mine 18560) 



Rat vanilloid receptor 1 (VR1) 



Amino Acid Coding Region: 



CDNA Length: 

Source (Tissue / Cell Line): 



ORF: 361-2652 



2806 bp 



First clone identified in an internal heart library 



Novel / Unrecognized: 
% Novelty: 



Patent Status: 



? Unrecognized / Novel 



66% novel across the complete cDNA. , . 

Hits unannotated sequence in Non-Public Patent Data Base 



Filed Nov. 1, 1998 



Full Length Clone: 



Yes 



Expression Profiling Results: 



Present in a sub-population of sensory neurons different from VR1. 
Also present in sympathetic neurons. 



Background: 


The published vanilloid receptor (Caterina et Al. Nature 389:816- 
24, 1997) responds to heat and capsaicin by activating Ca++ influx 
in sensory neurons (Tominaga eLal., Neuron 21: 531-43, 1998). 
Capsaicin also binds to this channel. 


Hypothesis: 


This channel may be responsible for hypersensitivity in chronic 
neurooathic pain and represents a unique target for pain. 


!•: . ^^^^M^^lv^^M^^fe 




| Assay Type: 


Cell-based assay 


| Reagents Needed: 


Open reading frame will be cloned by Millennium into pCDNA 3.1 


Readout: _ 




Critical Experiments Necessary: 
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